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— (54) Title: REGULATION OF EMBRYONIC TRANSCRIPTION IN PLANTS 

-950 ACTCA T AAAA AC TAG TAGATTGGTT GGTTGGTTTC C ATGTACCAG 

"^ .gg AtproFW -> 

-900 AAGGCTTACC CTATTAGTTG AAAGTTGAAA CTTTGTTCCC TACTCAATTC 

-850 CTAGTTGTGT AAATGTATGT ATATGTAATG CGTATAAAAC GTAGTACTTA 

^— i -800 AATGACTAGG AGTGGTTCTT GAGACCGATG AGAGATGGGA GCAGAACTAA 

-750 AG ATG AT G AC ATAATTAAGA ACGAATTTGA AAGGCTCTTA GGTTTGAATC 

-700 CTATTCGAGA ATGTTTTTGT CAAAGATAGT GGCGATTTTG AAC C AAAGAA 

-650 AAC AT T T AAA AAATCAGTAT CCGGTTACGT TCATGCAAAT AGAAAGTGGT 

-600 CTAGGATCTG ATTGTAATTT T AGACT T AAA GAGTCTCTTA AGATTCAATC 

555 " 550 CTGGCTGTGT ACAAAACTAC AAATAATATA TTTTAGACTA TTTGGCCTTA 

g§|§§ -500 ACTAAACTTC CACTCATTAT TTACTGAGGT TAGAGAATAG ACTTGCGAAT 

" -4 50 AAAC AC AT T C CCGAGAAATA CTCATGATCC CATAATTAGT CAGAGGGTAT 

-400 GCCAATCAGA TCTAAGAACA CACATTCCCT CAAATTTTAA TGCACAT GTA 

-350 ATCATAGTTT AGCACAATTC AAAAATAATG T AGT AT T AAA GACAGAAATT 

-300 TGTAGACTTT TTTTTGGCGT TAAAGGAAGA CTAAGTTTAT ACGTACATTT 

-250 TATTTTAAGT GGAAAACCGA AATTTTCCAT CGAAATATAT GAATTTAGTA 

-200 TATATATTTC TGCAATGTAC TATTTTGCTA TTTTGGCAAC TTTCAGTGGA 

-150 CTACTACTTT ATTACAATGT GTATGGATGC AT G AGT T T GA G TAT AC AC AT 

-100 GTCTAAATGC ATGCTTTGCA AAACGTAACG GACCACAAAA GAGGATCCAT 



-50 GCAAATACAT CTCATAGCTT CCT CCATTAT TTTCCGACAC AAAC AG AG C A 

AtproRV 

1 | ATG| ACGTCCG TTAACGTTAA GCTCCTT 



(57) Abstract: Nucleic acid constructs are provided comprising transcriptional regulatory regions homologous to plant FAE1 pro- 
moters. In some embodiments, these constructs may be used in transgenic cells or plants to promote expression of foreign and 

^ endogenous genes in developing seeds, for example to affect seed lipid metabolism, protein or carbohydrate composition and accu- 

¥^ mulation, or seed development. 
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REGULATION OF EMBRYONIC TRANSCRIPTION IN PLANTS 



FIELD OF THE INVENTION 

The invention is in the field of nucleic acid sequences capable of regulating 
5 transcription, particularly sequences that may promote transcription during embryo genesis in 
plants. 



BACKGROUND OF THE INVENTION 

Most of the information about seed-specific gene expression comes from studies of 

10 genes encoding seed storage proteins like napin, a major protein in the seeds of Brassica 

napus, or conglycinin of soybean. Upstream DNA sequences directing strong embryo-specific 
expression of these storage proteins have been used successfully in transgenic plants to 
manipulate seed lipid composition and accumulation (Voelker et aL. 1996). However, 
expression of storage protein genes begins fairly late in embryogenesis. Thus, promoters of 

15 seed storage protein genes may not be ideal for all seed-specific applications. For example, 
storage oil accumulation commences significantly before the highest level of expression of 
either napin (Stalberg et aL, 1996) or conglycinin (Chen et aL, 1988) is achieved. It is, 
therefore of interest to identify other promoters which may modulate expression of genes in 
developing plant embryos. 

20 A variety of transcriptional regulatory regions that may be active during plant 

embryogenesis are known, as disclosed for example in: U.S. Patent No. 5,792.922 issued 1 1 
August 1998 to Moloney; U.S. Patent No. 5.623,067 issued 22 April 1997 to Vandekerckhove 
et ah: International Patent Publication W09845461 published 15 October 1998. There remains 
a need for alternative transcriptional regulatory regions. 

25 FATTY ACID ELONGATION 1 (FAEI) genes encode condensing enzymes involved in 

plant very long chain fatty acid biosynthesis. The FAEI condensing enzyme is thought to be 
localized in the endoplasmic reticulum where it catalyzes the sequential elongation of CI 8 
fatty acyl chains to C22 in length (Kunst et aL, 1992). FAEI genes have been cloned and 
described recently by James et aL (1995). International Patent Publication WO 96/13582. 

30 

SUMMARY OF THE INVENTION 

In one aspect, the invention provides transcriptional regulatory regions derived from 
FAEI genes. The transcriptional regulatory regions of the invention may be useful in 
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promoting early seed-specific transcription of heterologous sequences to which they are 
operably linked. The transcriptional regulatory regions of the invention may be used in a wide 
variety of plants, including Brassica sp. , Arabidopsis and other plant species. DNA constructs 
comprising the transcriptional regulatory sequences of the invention may be active during 
5 fatty acid or lipid biosynthesis in the plant embryo. Certain embodiments of the constructs of 
the invention may be used in transgenic plants to promote expression of heterologous 
sequences in developing seeds. In various embodiments, the constructs of the invention may 
be used to mediate gene expression that affects seed lipid metabolism, or seed protein 
composition or seed carbohydrate composition, or seed development. In alternative 
10 embodiments, the transcriptional regulatory regions of the invention may also be useful for the 
production of modified seeds containing novel recombinant proteins which have 
pharmaceutical, industrial or nutritional value. 

BRIEF DESCRIPTION OF THE DRAWINGS 
15 Figure 1 shows a 934 bp DNA sequence comprising the Arabidopsis thaliana FAE1 

transcription regulatory sequence. 

Figure 2 shows a 1588 bp DNA sequence comprising the Brassica napus FAE1 
transcription regulatory sequence. 

Figure 3 shows a 1069 bp DNA sequence comprising the Lunaria annua FAE1 
20 transcription regulatory sequence. 

Figure 4 shows an alignment of the Arabidopsis thaliana (A.L) 9 Lunaria annua (L.a.) 
and Brassica napus (B.n.) transcription regulatory sequences. Asterisks below the sequences 
indicate identical nucleotides in each of the three sequences. A number of putative cis-acting 
sequence motifs are identified in the A. thaliana sequence: an EMI ABA box at -44bp to - 
25 36bp having the sequence ACATCTCAT, for which the published consensus sequence is 
ACGTGTCAT (Rowley, D.L. and Herman, E.M. (1997), Biochimica et Biophysica Acta 
1345:1-4); an A-300 box at -51 bp to -46bp having the sequence TGCAAT, for which the 
published consensus sequence is TG(T/A/C)AAA(G/T) (Morton et al. (1994) in Seed 
Development and Germination (Kigel, J. and Gallili, G., eds.) pp. 103-138, Marcel Dekker. 
30 New York); G-box 1 at -105 to -100 bp having the sequence CACATG, for which is the 
consensus sequence is CACCTG, and G-box 2 at -164 to -159 bp having the sequence 
CAACTT, for which the consensus sequence is CAACTG (Kawogoe, Y. and Murai, N. 
(1992) Plant J. 2:927-936; CE1 element at -226 to -218 bp having the sequence 
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TTCCATCGA. for which the consensus sequence is TGCCACCGG. and a CE3 element at - 
381bp to -369 bp having the sequence ACACATTCCCTC, for which the consensus sequence 
is ACGCGTGTCCTC (Shen et ah, (1996) Plant Cell 8:1 107-1 1 19). Not highlighted is a 
putative RY repeat motif at -53bp to -47bp having the sequence CATGCAA, for which the 
5 consensus sequence is CATGCAT (Dickinson et al. (1988) Nucleic Acid Res. 16:371; 
Lelievre et al. (1992) Plant Physiol. 98:387-391). Also shown, as Con. 4, is a consensus 
sequence, wherein R=G or A, Y=T/U or C, M=A or C, K=G or T/U, S=G or C, \V=A or T/U, 
B=G or C or T/U, D=A or G or T/U, H=A or C or T/U, V=A or G or C and N=A or G or C or 
T/U. 

10 Figure 5 shows an alignment of the Arabidopsis thaliana (A J.) and Lunaria annua 

{L.a.) transcription regulatory sequences. Asterisks below the sequences indicate identical 
nucleotides in each of the two sequences. The base at position -400 in the A.t. sequence is 
highlighted. The alignment of sequences in both Figure 4 and Figure 5 was accomplished 
using the CLUSTALW program (version 1.74) for multiple sequence alignments, using a gap 

15 open penalty of 15, a gap extension penalty of 6.66 and an IUB DNA weight matrix. Also 
shown, as Con. 5, is a consensus sequence, wherein R=G or A, Y=T/U or C, M=A or C, K=G 
or T/U, S=G or C, W=A or T/U, B=G or C or T/U, D=A or G or T/U, H=A or C or T/U, V=A 
or G or C and N=A or G or C or T/U. 

Figure 6 includes two bar graphs illustrating hydroxy fatty acid content of A) FAE1- 
20 FAH12 and B) napin-FAH12 transgenic seeds, expressed as percentage of total seed fatty 
acids. 

Figure 7 shows an alignment of the Brassica napus (B.n.) and Lunaria annua (L.a.) 
FEA1 transcription regulatory sequences. Asterisks below the sequences indicate identical 
nucleotides in each of the two sequences. 
25 Figure 8 shows an alignment of the Brassica napus {B.n.) and Arabidopsis thaliana 

{A.t.) FEA1 transcription regulatory sequences. Asterisks below the sequences indicate 
identical nucleotides in each of the two sequences. 

DETAILED DESCRIPTION OF THE INVENTION 

30 The recombinant nucleic acid molecules of the invention may comprise a heterologous 

promoter sequence operably linked to a nucleic acid sequence, wherein the promoter 
sequence comprises a transcriptional regulatory region capable of mediating seed-specific 
expression in Arabidopsis. The transcriptional regulatory region may be obtainable from a 
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plant FAE1 gene. Alterntively, The transcriptional regulatory region may hybridize under 
stringent conditions to a 5' region of the plant FAE1 gene. In further alternative 
embodiments, The transcriptional regulatory region may be at least 70% identical when 
optimally aligned to the 5' region of the plant FAE1 gene. 
5 In alternative embodiments, the invention provides isolated nucleic acids comprising 

the transcriptional regulatory regions of the invention. By isolated, it is meant that the 
isolated substance has been substantially separated or purified away from other biological 
components with which it would otherwise be associated, for example in vivo. The term 
isolated' therefore includes substances purified by standard purification methods, as well as 
10 substances prepared by recombinant expression in a host as well as chemically synthesized 
substances. 

In the context of the present invention, "transcriptional regulatory region" means a 
nucleotide sequence capable of mediating or modulating transcription of a nucleotide 
sequence of interest, when the transcriptional regulatory region is operably linked to the 

1 5 sequence of interest. Conversely, a transcriptional regulatory region and a sequence of interest 
are "operably linked" when the sequences are functionally connected so as to permit 
transcription of the sequence of interest to be mediated or modulated by the transcriptional 
regulatory region. In some embodiments, to be operably linked, a transcriptional regulatory 
region may be located on the same strand as the sequence of interest. The transcriptional 

20 regulatory region may in some embodiments be located 5' of the sequence of interest. In such 
embodiments, the transcriptional regulatory region may be directly 5' of the sequence of 
interest or there may be intervening sequences between these regions. The operable linkage of 
the transcriptional regulatory region and the sequence of interest may require appropriate 
molecules (such as transcriptional activator proteins) to be bound to the transcriptional 

25 regulatory region, the invention therefore encompasses embodiments in which such molecules 
are provided, either in vitro or in vivo. 

The term "recombinant" means that something has been recombined. so that when 
made in reference to a nucleic acid molecule the term refers to a molecule that is comprised of 
nucleic acid sequences that are joined together by means of molecular biological techniques. 

30 The term "recombinant" when made in reference to a protein or a polypeptide refers to a 
protein molecule which is expressed using a recombinant nucleic acid molecule. The term 
"heterologous" when made in reference to a nucleic acid sequence refers to a nucleotide 
sequence which is ligated to, or is manipulated to become ligated to, a nucleic acid sequence 
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to which it is not iigated in nature, or to which it is ligated at a different location in nature. The 
term "heterologous" therefore indicates that the nucleic acid molecule has been manipulated 
using genetic engineering, i.e. by human intervention. 

Sequences may be derived or obtainable from plant FAE1 genes by deduction and 
5 synthesis based upon the wild-type FAE1 gene sequences. Derived sequences may be 

identified in different organisms, for example by isolation using as probes the nucleic acid 
sequences of the invention. Alternative transcriptional regulatory regions may be derived 
through mutagenesis or substitution of wild-type sequences, such as the sequence disclosed 
herein. Derived nucleic acids of the invention may be obtained by chemical synthesis, 
10 isolation, or cloning from genomic DNAs using techniques known in the art, such as the 

Polymerase Chain Reaction (PCR). Consensus sequences, such as those illustrated in Figures 
4 and 5 are alternative embodiments of the nucleic acids of the invention, derived from the 
disclose wild-type FAE1 gene sequences. Nucleic acids of the present invention may be used 
to design alternative primers (probes) suitable for use as PCR primers to amplify particular 
1 5 regions of an FAE1 gene. Such PCR primers may for example comprise a sequence of 1 5-20 
consecutive nucleotides of the sequences of the invention. To enhance amplification 
specificity, primers of 20-30 nucleotides in length may also be used. Methods and conditions 
for PCR amplification are described in Innis et al. (1990); Sambrook et al. (1989); and 
Ausubel et al. (1995). As used herein, the term "probe" when made in reference to an 
20 oligonucleotide refers to an oligonucleotide which is capable of hybridizing to another 

oligonucleotide of interest. A probe may be single-stranded or double-stranded. Probes are. for 
example, useful in the detection, identification, amplification and isolation of particular gene 
sequences. Oligonucleotide probes may be labelled with a "reporter molecule." so that the 
probe is detectable using a detection system, such as enzymatic, fluorescent, radioactive or 
25 luminescent detection systems. 

Derived nucleic acids of the invention may also be identified by hybridization, such as 
Southern or Northern analysis. Southern analysis is a method by which the presence of DNA 
sequences in a target nucleic acid mixture are identified by hybridization to a labeled probe, 
comprising an oligonucleotide or DNA fragment of a nucleic acid of the invention. Probes for 
30 Southern analysis may for example be at least 15 nucleotides in length. Southern analysis 

typically involves electrophoretic separation of DNA digests on agarose gels, denaturation of 
the DNA after electrophoretic separation, and transfer of the DNA to nitrocellulose, nylon, or 
another suitable membrane support for analysis with a radiolabeled, biotinylated. or enzyme- 
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labeled probe as described in Sambrook et al. (1989). Similarly, Northern analysis may be 
used to identify RNAs that hybridize to a known probe such as an oligonucleotide, DNA 
fragment, cDNA or fragment thereof, or RNA fragment of a nucleic acid of the invention or a 
known FAE1 sequence. The probe may be labeled with a radioisotope such as 32 P, by 
5 biotinylation or with an enzyme. The RNA to be analyzed may be electrophoretically 

separated on an agarose or polyacrylamide gel, transferred to nitrocellulose, nylon, or other 
suitable membrane, and hybridized with the probe, using standard techniques well known in 
the art such as described in Sambrook et al. (1989). 

In alternative embodiments, a transcriptional regulatory region of the invention may 
10 be at least 70% identical when optimally aligned to the 5' region of a plant FAE1 gene, such 
as the Arabidopsis FAE1 gene. In alternative embodiments, the degree of identity may be 
between 50% and 100%, such as 60%, 80%, 90%, 95% or 99%. When a position in the 
compared sequence is occupied by the same nucleotide or amino acid, following optimal 
alignment of the sequences, the molecules are considered to have identity at that position. The 

15 degree of identity between sequences is a function of the number of matching positions shared 
by the sequences. In terms of percentage, identity is the sum of identical positions, divided by 
the total length over which the sequences are aligned, multiplied by 1 00. 

Various aspects of the present invention encompass nucleic acid or amino acid 
sequences that are homologous to other sequences. As the term is used herein, an amino acid 

20 or nucleic acid sequence is "homologous" to another sequence if the two sequences are 

substantially identical and the functional activity of the sequences is conserved (for example, 
both sequences function as or encode a FAE1 enzyme; as used herein, the term 'homologous' 
does not infer evolutionary relatedness). Nucleic acid sequences may also be homologous if 
they encode substantially identical amino acid sequences, even if the nucleic acid sequences 

25 are not themselves substantially identical, a circumstance that may for example arise as a 
result of the degeneracy of the genetic code. 

Two amino acid or nucleic acid sequences are considered substantially identical if, 
when optimally aligned (with gaps permitted), they share at least about 50% sequence 
similarity or identity, or if the sequences share defined functional motifs. In alternative 

30 embodiments, sequence similarity in optimally aligned substantially identical sequences may 
be at least 60%, 70%, 80%, 90% or 95%. As used herein, a given percentage of homology 
between sequences denotes the degree of sequence identity in optimally aligned sequences. 
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Optimal alignment of sequences for comparisons of similarity may be automated using 
a variety of algorithms, such as the local homology algorithm of Smith and Waterman (198 1) 
Adv. Appl Math 2: 482, the homology alignment algorithm of Needleman and Wunsch (1970) 
J. Mol Biol 48:443, the search for similarity method of Pearson and Lipman (1988) Proc. 
Natl. Acad. Sci. USA 85: 2444, and the computerized implementations of these algorithms 
(such as GAP, BESTFIT, FASTA and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, Madison, WI, U.S.A.). Sequence similarity may also be 
determined using the BLAST algorithm, described in Altschul et al. (1990), J. Mol Biol 
215:403-10 (using the published default settings). Software and instructions for performing 
BLAST analysis may be available through the National Center for Biotechnology Information 
in the United States (including the programs B LAS TP, BLASTN, BLASTX, TBLASTN and 
TBLASTX that may be available through the internet at http://www.ncbi.nlm.nih.gov/) . The 
BLAST algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying 
short words of length W in the query sequence that either match or satisfy some positive- 
valued threshold score T when aligned with a word of the same length in a database 
(reference) sequence. T is referred to as the neighborhood word score threshold. Initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs. The word hits 
are extended in both directions along each sequence for as far as the cumulative alignment 
score can be increased. Extension of the word hits in each direction is halted when the 
following parameters are met: the cumulative alignment score falls off by the quantity X from 
its maximum achieved value; the cumulative score goes to zero or below, due to the 
accumulation of one or more negative-scoring residue alignments: or the end of either 
sequence is reached. The BLAST algorithm parameters W, T and X determine the sensitivity 
and speed of the alignment. The BLAST program may use as defaults a word length (W) of 
1 1, the BLOSUM62 scoring matrix (Henikoff and Henikoff (1992) Proc. Natl Acad ScL USA 
89: 10915-10919), a gap existence cost of 1 1, a per residue gap cost of 1, a lambda ratio of 
0.85, alignments (B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of both 
strands. One measure of the statistical similarity between two sequences using the BLAST 
algorithm is the smallest sum probability (P(N)), which provides an indication of the 
probability by which a match between two nucleotide or amino acid sequences would occur 
by chance. In alternative embodiments of the invention, nucleotide or amino acid sequences 
are considered substantially identical if the smallest sum probability in a comparison of the 
test sequences is less than about 1, preferably less than about 0.1, more preferably less than 
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about 0.0 1, and most preferably less than about 0.00 L In the PSI-BLAST implementation of 
the BLAST algorithm, an expect value for inclusion in PSI-BLAST iteration may be 0.001 
(Altschul et al. (1997), Nucleic Acids Res. 25:3389-3402). Searching parameters may be 
varied to obtain potentially homologous sequences from database searches. 
5 An alternative indication that two nucleic acid sequences are substantially identical is 

that the two sequences hybridize to each other under moderately stringent, or preferably 
stringent, conditions. Hybridization to filter-bound sequences under moderately stringent 
conditions may, for example, be performed in 0.5 M NaHPCU, 7% sodium dodecyl sulfate 
(SDS), 1 mM EDTA at 65EC and washing in 0.2 x SSC/0.1% SDS at 42EC (see Ausubel. et 

10 al. (eds), 1989, Current Protocols in Molecular Biology, Vol. 1, Green Publishing Associates. 
Inc., and John Wiley & Sons, Inc., New York, at p. 2.10.3). Alternatively, hybridization to 
filter-bound sequences under stringent conditions may, for example, be performed in 0.5 M 
NaHP0 4 , 7% SDS, 1 mM EDTA at 65EC, and washing in 0.1 x SSC/0.1% SDS at 68EC (see 
Ausubel. et al. (eds), 1989, supra). Hybridization conditions may be modified in accordance 

15 with known methods depending on the sequence of interest (see Tijssen, 1993, Laboratory 
Techniques in Biochemistry and Molecular Biology — Hybridization with Nucleic Acid 
Probes, Part I, Chapter 2 "Overview of principles of hybridization and the strategy of nucleic 
acid probe assays", Elsevier, New York). Generally, stringent conditions are selected to be 
about 5EC lower than the thermal melting point for the specific sequence at a defined ionic 

20 strength and pH. 

A FAE1 promoter is any naturally occurring transcriptional regulatory region that 
mediates or modulates the expression of a plant FAE1 condensing enzyme. Plant FAE1 
condensing enzymes are proteins that are homologous to known FAE1 condensing enzymes, 
such as those cloned and described in International Patent Publication WO 96/13582. 

25 Heterologous DNA sequences may for example be introduced into a host cell by 

transformation. Such heterologous molecules may include sequences derived from the host 
cell species, which have been isolated and reintroduced into cells of the host species. 
Heterologous nucleic acid sequences may become integrated into a host cell genome, either as 
a result of the original transformation of the host cells, or as the result of subsequent 

30 recombination events. Transformation techniques that may be employed include plant cell 
membrane disruption by electroporation, microinjection and polyethylene glycol based 
transformation (such as are disclosed in Paszkowski et al EMBO J. 3:2717 (1984); Fromm et 
al, Proc. Natl. Acad. Sci. USA 82:5824 (1985); Rogers et al, Methods Enzymol. 1 18:627 
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(1986); and in U.S. Patent Nos. 4,684,61 1; 4,801,540; 4,743,548 and 5,231,019), biolistic 
transformation such as DNA particle bombardment (for example as disclosed in Klein, et al, 
Nature 327: 70 (1987); Gordon-Kamm, et al "The Plant Cell" 2:603 (1990); and in U.S. 
Patent Nos. 4,945,050; 5,015,580; 5,149,655 and 5,466,587); Agrobactehum-mediated 
5 transformation methods (such as those disclosed in Horsch et al Science 233: 496 (1984); 
Fraley et al, Proc. Natl Acad. Sci. USA 80:4803 (1983); and U.S. Patent Nos. 4,940,838 and 
5,464,763). 

Standard methods are available for the preparation of constructs for use in identifying 
and characterizing transcriptional regulatory regions useful in various embodiments of the 
10 invention. General molecular techniques may for example be performed by procedures 

generally described by Ausubel FM, Brent R, Kingston RE, Moore DD, Seidman JG, Smith 
JA, Stuhl K. (1995) Current Protocols in Molecular Biology, Vols 1, 2 and 3. Alternative 
equivalent methods or variations thereof may be used in accordance with the general 
knowledge of those skilled in this art and the functional requirements of the present invention. 
15 In some aspects of the invention, transformed plant cells may be cultured to regenerate 

whole plants having a transformed genotype and displaying a desired phenotype, as for 
example modified by the expression of a heterologous protein mediated by a transcriptional 
regulatory region of the invention. A variety of plant culture techniques may be used to 
regenerate whole plants, such as are described in Gamborg and Phillips, "Plant Cell, Tissue 

20 and Organ Culture, Fundamental Methods", Springer Berlin, 1995); Evans et al "Protoplasts 
Isolation and Culture 5 ', Handbook of Plant Cell Culture, Macmillian Publishing Company, 
New York. 1983: or Binding, "Regeneration of Plants, Plant Protoplasts", CRC Press, Boca 
Raton, 1985: or in Klee et aL Ann. Rev. of Plant Phys. 38:467 (1987). A cell, tissue, organ, or 
organism into which has been introduced a foreign nucleic acid, is considered "transformed*'. 

25 "transfected". or "transgenic". A transgenic or transformed cell or organism also includes 

progeny of the cell or organism and progeny produced from a breeding program employing a 
transgenic plant as a parent in a cross and exhibiting an altered phenotype resulting from the 
presence of a recombinant nucleic acid construct. A transgenic plant is therefore a plant that 
has been transformed with a heterologous nucleic acid, or the progeny of such a plant that 

30 includes the transgene. The invention provides vectors, such as vectors for transforming plants 
or plant cells. The term "vector' 1 in reference to nucleic acid molecule generally refers to a 
molecule that may be used to transfer a nucleic acid segment(s) from one cell to another. One 
of skill will recognize that after the nucleic acid is stably incorporated in transgenic plants and 
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confirmed to be operable, it can be introduced into other plants by sexual crossing. Any of a 
number of standard breeding techniques may be used, depending upon the species to be 
crossed. 

In various embodiments, the invention comprises plants transformed with the nucleic 
5 acids of the invention. In some embodiments, such plants will exhibit altered fatty acid 
content in one or more tissues. These aspects of the invention relate to all higher plants, 
including monocots and dicots, such as species from the genera Fragaria. Lotus, Medicago, 
Onobrychis, Triforium, Trigonelia, Wgna, Citrus, Linum. Geranium, Manihot, Caucus, 
Arabidopsis, Brassica, Raphanus, Sinapis, Atropa, Capsicum, Hyoscyamus, Lycopersicon, 

10 Nicotiana, Solarium, Petunia, Digitalis, Majorana, Cichorium, Helianthus. Lactuca, Bromus, 
Asparagus, Antirrhinum, Heterocatlis, Nemesia, Pelargonium, Panicum, Penniserum, 
Ranunculus, Senecio, Salpiglossis, Cucarnis, Browallia, Glycine, Lolium, Zea, Triticum, 
Sorghum, and Datura. Such plants may include maize, wheat, rice, barley, soybean, beans, 
rapeseed. canola. alfalfa, flax, sunflower, cotton, clover, lettuce, tomato cucurbits, potato 

15 carrot, radish, pea lentils, cabbage, broccoli, brussel sprouts, peppers, apple, pear, peach, 

apricot, carnations and roses. More specifically, in alternative embodiments, plants for which 
the invention may be used in modifying fatty acid content include oil crops of the Cruciferae 
family: canoia, rapeseed (Brassica spp.), crambe (Crambe spp.), honesty (Lunaria spp.) 
lesquerella (Lesquerela spp.), and others; the Composirae family: sunflower (Helianthus 

20 spp.), safflower (Carthamus spp.), niger (Guizotia spp.) and others; the Palmae family: palm 
(Elaeis spp.), coconut (Cocos spp.) and others; the Leguminosae family: peanut (Arachis 
spp.), soybean (Glycine spp.) and others: and plants of other families such as maize (Zea spp.), 
cotton (Gossvpiun sp.), jojoba (Simonasia sp.), flax (Linum sp.), sesame (Sesamum spp.), 
castor bean (Ricinus spp.), olive (Olea spp.), poppy (Papaver spp.), spurge (Euphorbia, spp.), 

25 meadowfoam (Limnanthes spp.), mustard (Sinapis spp.) and cuphea (Cuphea spp.). 

Nucleic acids of the invention may also be used as a plant breeding tool, as molecular 
markers to aid in plant breeding programs. Such techniques would include using the gene 
itself as a molecular probe or using the DNA sequence to design PCR primers to use PCR 
based screening techniques in plant breeding programs. 

30 Deletion or insertion constructs may be useful for domain mapping to determine the 

functional domains or motifs of a transcriptional regulatory region derived from a FAE1 gene. 
An aspect of the invention is the construction and testing of such constructs, as described 
below for the 5' deletion construct of the A. thaliana FAE1 5' region. One aspect of the 
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invention comprises transcriptional regulatory regions that are derived from functionally 
important regions of a FAE1 promoter. As outlined above, the functionally important regions 
of a FAE1 promoter may be determined through routine assays. Alternatively, randomly 
selected portions of a a FAE1 promoter may be selected for use in routine assays to determine 
5 whether the selected region is capable of functioning as a transcriptional regulatory region in 
the context of the present invention. In various embodiments, regions of the Arabidopsis 
thaliana, Brassica napus or Lunaria annua promoters may be used. For example, the 
following motifs in the A J. FAE1 promoter may be used alone or in combination in novel 
transcriptional regulatory regions (see Figure 4): the CE-like elements (CE1 and CE3), the RY 

10 repeat motif, the G-boxes (G-boxl and G-box2), the A-300 box, the EMI ABA box, or the 
CTATTTTG element. Constructs of the invention comprising such motifs, deletions or 
insertions may be assayed for activity as transcriptional regulatory regions of the invention by 
testing for strong seed-specific activity providing expression of a sequence of interest (such as 
a reporter sequence) before the torpedo stage and persisting throughout embryo development. 

15 in accordance with standard testing methods that may be adapted from the methods disclosed 
herein. 

Alternative embodiments of the transcriptional regulatory regions of the invention may 
be identified using information available through NCBI databases at http://www.ncbi.nih.gov. 
In various embodiments, transcriptional regulatory regions derived from plant FAE1 
20 genes are shown to be capable of directing expression of desired genes at an early stage of 
development in a seed-specific manner in disparate plant species. In particular embodiments, 
the transcriptional regulatory regions of the invention may be used in a wide variety of 
dicotyledonous plants for modification of the seed phenotype. For example, new seed 
pheno types may include: 
25 (1) altered seed fatty acid composition or seed oil composition and accumulation 

(2) altered seed protein or carbohydrate composition or accumulation 

(3) enhanced production of desirable endogenous seed products 

(4) suppression of production of undesirable gene products using antisense, co-suppression or 
ribozyme technologies 

30 (5) production of novel recombinant proteins for pharmaceutical, industrial or nutritional 
purposes 

Isolation of a seed-specific promoter from A thaliana 
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Using the sequence information of the A. thaliana genome sequencing project, 
synthetic oligonucletide primers were designed to amplify the FAE1 5' untranslated region, to 
isolate it by PCR. As shown in Figure 1, the upstream primer 5'- 
CTAGTAGATTGGTTGGTTGGTTTCC-3 ' (AtproFW) in combination with the downstream 
5 primer 5 ' -TGCTCTGTTTGTGTCGGAAAATAATGG-3 1 (AtproRV) were used, and resulted 
in the synthesis of a fragment of the correct size (934 bp). The amplified product was 
subcloned in the Hindi site of the plasmid pT7T3-18U (Pharmacia) to produce plasmid 
pT7T3-18U/proFAE900, followed by complete sequence determination of both strands to 
verify the fragment identity. A BLAST search of the A thaliana Database identified a single 
10 BAC clone T4L20 (GenBank ATF10M6) 125,179 bp long, which contains the complete 
FAE1 gene. 

Functional analysis of the FAE1 5 ? upstream region 

5' upstream fragments of the FAE1 gene were shown to confer seed-specific and 
temporally regulated gene expression in plants. A translational fusion was made between the 

15 FAE1 5' region and the coding region of the reporter gene p -glucuronidase (GUS). The 
chimeric gene (pFAE900-GUS or pFAE400-GUS) was transferred into Arabidopsis and 
tobacco and GUS activity was monitored in various tissue of transgenic plants. 

Construction of the vectors pFAE900-GUS and pFAE400-GUS, and transformation of 
Arabidopsis and tobacco, was as follows. The insert was cleaved out of pT7T3-18U vector 

20 with HindUl and Xbal and directionally subcloned into the corresponding sites of the binary 
Ti plasmid pBHOl (Clontech), which contains a promoteriess GUS gene (Jefferson et al. 
1987), to obtain the vector pFAE900-GUS. Another construct. pFAE400-GUS. containing 
only 393 bp of the 5* FAE1 region directly upstream of the ATG initiation codon fused to the 
GUS coding sequence was also generated. For that, the pT7T3-18U/proFAE900 vector was 

25 digested with BglH and PstI, the sticky ends were filled in using T4 DNA polymerase, 
followed by re-ligation to obtain pT7T3-18U/proFAE400. The 393 bp 5* FAE1 upstream 
fragment was then excised with Hindlll and Xbal and cloned into the binary vector pBHOl to 
obtain the plasmid pFAE400-GUS. The pFAE400-GUS and pFAE900-GUS fusion constructs 
in pBHOl were introduced into Agrobactehum tumefaciens strain GV3101 (Koncz and 

30 Schell, 1986) by heat-shock and selected for resistance to kanamycin (50 jag/'ml). A. thaliana 
(L.) Heynh. ecotype Columbia was transformed with the pFAE400-GUS and pFAE900-GUS 
constructs using floral dip method (Clough and Bent, 1998). Screening for transformed seed 
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was done on 50|ig/mL kanamycin as described previously (Katavic et al. ? 1994). 
Approximately 100 transgenic lines were generated for each construct. 

For transformation of tobacco, A. tumefaciens harbouring the pFAE900-GUS construct 
was co-cultivated with leaf pieces of Nicotiana tabacum SRI and transformants were selected 
5 with kanamycin (100|ig/mL) on solid medium (Lee and Douglas, 1996). 

Histochemical localization of GUS activity and analysis of transgenic plants was as 
follows. Tissue sections were placed in 100 mM NaP0 4 (pH7) and 1 mM spermidine for 15 

min, then incubated at 37° C in 0.5 K 3 [Fe(CN) 6 ], 0.01 % Triton X-100, ImM EDTA, 10 mM 
P-mercaptoethanol, 5-bromo-4-chloro-3-indolyl-P-D-glucuronide in 100 mM NaP0 4 (pH7), 

10 until a blue color appeared (after approximately 1 hr). Following incubation with the substrate, 
chlorophyll was removed from the sections using a graded ethanol series. 

Using this assay, five independent transgenic Arabidopsis lines were examined for the 
embryo-specific expression of the GUS gene. In addition, leaf, stem and siliques were 
histochemically stained for p -glucuronidase activity. The results indicate that the reporter 

15 gene fused to the transcriptional regulatory region of the invention is not expressed in 
vegetative tissues, whereas it is highly expressed in developing seeds (embryos). Both the 934 
bp and the 393 bp transcriptional regulatory regions derived from the A J. FAE1 gene caused 
the appearance of GUS activity by the torpedo stage embryo (6 days after flowering). GUS 
activity in all five lines persisted throughout subsequent embryo development. 

20 Leaves, stems, pods and seeds of three regenerated tobacco lines transformed with the 

pFAE900-GUS construct were also assayed for P-glucuronidase activity. The results obtained 
indicate that the 934 bp FAE1 promoter fragment contains sufficient information to direct 
seed-specific expression of a reporter gene in transgenic tobacco. Thus the transcriptional 
regulatory regions of the invention may be used for seed-specific expression of foreign genes 

25 in transgenic plants. 

The in vivo activity of a FAE1 promoter of the invention was compared to the activity 
of the napin promoter by expressing the castor bean hydroxylase gene FAH12 (Broun and 
Somerville, 1997) behind either the FAE1 -promoter (a transcriptional regulatory region of 
approximately 1 kb) or the napin promoter in an Arabidopsis fad2/fael double mutant. This 

30 mutant accumulates as a proportion of fatty acids about 85% of the 18:1 acyl group, which is 
the substrate for the hydroxylase. The levels of hydroxylated fatty acids accumulating in a 
large number of independent transgenic lines were used to estimate the relative strength of 
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each promoter. As shown in Figure 6, the two populations of transgenic plants accumulated 
levels of hydroxylated fatty acids, ranging from 0.2% to about 11-12% of total fatty acids, 
with the levels being on average slightly higher in FAE1-FAH12 lines. Similarly, the best 
FAE1-FAH12 plant accumulated just over 12% of hydroxylated fatty acids (w/w of total FAs), 
5 whereas the best napin-FAH12 plant produced 10.8% of hydroxylated fatty acids (w/w of total 
FAs). These results indicate that the FAE1 promoter is highly active in transgenic Arabidopsis 
and that its in vivo activity may be superior to napin in Arabidopsis seeds. 

Sequence elements or motifs that confer both tissue specificity and developmental 
regulation of transcription reside within 393 bp of the AUG translation initiation codon in the 

10 A.t. FAE1 gene. The seed-specific expression conferred by the transcriptional regulatory 
regions of the invention is independent of the native terminator of the FAE1 gene 3' end. For 
example, in the exemplified constructs disclosed herein, a terminator derived from the 
Agrobacterium nopaline synthase gene was used. 

Lunaria annua and Brassica napus FAE1 5* regulatory regions 

15 Two sequences originating from B. napus and L. annua were isolated and 

characterized to demonstrate that regulatory regions conferring seed-specific transcription 
early in embryo development can also be found upstream of other plant FAE1 genes. 
Sequences were cloned using the technique of polymerase chain reaction (PCR) walking on 
uncloned plant genomic DNA (Devic et al., 1997). Approximately 5 ^g of genomic DNA 

20 from 1 g of fresh tissue was used for the construction of 5 different libraries by digesting DNA 
with a series of enzymes that produce blunt end fragments to which special adaptors are 
ligated. The adaptor molecules consist of a long upper strand, which contains successive 
sequences common to the adaptor primers. API and AP2. annealed at its 3' end to a shorter 
strand lacking the API sequence. However, this short strand posseses an amine group at its 3 ? 

25 end to prevent filling in by the DNA polymerses during the first PCR amplification step and 
generation of the API binding site. This suppression PCR effect prevents exponential 
amplification of molecules containing the adaptor at each end, and the adaptor primer binding 
sites are only produced when a strand complementary to the upper strand of the adaptor is 
synthesized by extension from a gene specific primer. The first PCR reaction is performed 

30 using an adaptor primer API and a gene specific primer. An aliquot of the first PCR product is 
used a template in a second PCR amplification using the nested gene specific primer and AP2. 

In order to isolate the regulatory regions upstream of the B. napus FAE1 coding 
sequence, genomic DNA was prepared from developing leaves and digested with 5 blunt-end 
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cutting restriction enzymes {Dral EcoR V, Hpal PvuII and Seal) to generate a series of DN A 
libraries. After ligation of adapter molecules, individual libraries were used as templates in a 
two step PCR. In the first PCR amplification using the API primer 5'- 
GGATCCTAATACGACTC ACTATAGGGC-3 ' and the FAE1 gene specific primer 5'- 
5 AAAG AGTGGAGCGATGGTTATGAGG-3 ' (Bnwalkl), multiple DNA fragments were 
amplified from all five library templates. After a second round of PCR, using the AP2 primer 
S'-CTATAGGGCTCGAGCGGC-S' and the nested FAE1 specific primer 5'- 
CGGAAAGAAGCAAAGGTTGAAAAGG-3 ' (Bnwalk2), the longest single fragment of 1.6 
kb was obtained from the Hpal library template. This fragment was inserted into the pCR2.1 

10 plasmid (Invitrogen) and sequenced. The sequence is shown in Figure 2. 

For the PCR walking experiment to isolate the L. annua 5' regulatory region, in 
adition to the standard API and AP2 primers, the following FAE1 specific primers were used: 
5'-GATCGTTTGTGGTAAGACGAGAGC-3 ' (Lawalkl) and 5 ? - 

GTCAGTGGGAAGAAACAGAGGTTG-3 ' (Lawalk2). In the first PCR reaction, the Dral 

15 EcoRV, PvuII Seal and Sspl library templates were used. In a second PCR amplification the 
longest single fragment 1.1 kb in length was synthesized using the EcoRV library template. 
This fragment was inserted into the Hindi site of the pT7T3-18U vector (Promega), 
sequenced on both strands and analyzed (Figure 3). 

Using the sequence data obtained for the 5' regulatory regions generated by PCR 

20 walking, specific primers were generated for the amplification of the L. annua and B. napus 
FAE1 promoter fragments. For the PCR-amplification of B. napus promoter fragment the 
upstream primer was 5 ' -CTG ACTTC ACC AAAG AAAC AACTCG-3 * (BnproFW) in 
combination with the downstream primer 5 ? -CGGAATTCCGTTTTTTTTTTTAGGCG-3 " 
(BnproRV). The synthesized fragment was ligated into the Smal site of pGEM-7Zf 

25 (Promega), then excised with Xbal/BamHI and cloned into the equivalent sites of the pBHOl 
binary vector (Clontech). L. annua 5' regulatory region was amplified using the 5'- 
C AGCTTAACCGGTAAAATTGGCC-3 ' (LaproFW) upstream primer together with the 5'- 
TGTTC AGTTTTGTGTCGGAGAGG-3 ' (LaproRV) downstream primer and inserted in the 
Hindi site of pT7T3-18U (Promega) plasmid. In order to clone the L. annua promoter 

30 fragment into the pBHOl binary vector, an Xbal site was added by subcloning the Pstl/Kpnl 
fragment released from the pT7T3-18U vector into pBluescript II KS+ (Stratagene). The 
fragment was then excised and cloned in the Xbal site of the pBHOl vector. 
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The resulting vectors pBnFAEl-GUS and pLaFAEl-GUS in pBHOl were then 
introduced into A. tumefaciens strain GV3101 by heat-shock, and used to transform 
Arabidopsis as described above. Transformants were selected on agar-solidified medium 
containing kanamycin (50 ng/ml). More than 100 transformants were generated for each 
construct. The activity of the L. annua and B. napus FAE1 promoters was determined by GUS 
expression assays on the developing seeds and also on non-reproductive plant tissues as 
controls. Consistent seed-specific GUS expression was obtained for both promoter constructs 
in independent transgenic lines. In contrast there was no detectable GUS activity in leaf stem 
and silique samples. 
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WHAT IS CLAIMED IS: 

1 . A recombinant nucleic acid molecule comprising a heterologous promoter sequence 

5 operably linked to a nucleic acid sequence, wherein the promoter sequence comprises 

a transcriptional regulatory region capable of mediating seed-specific expression in 
Arabidopsis wherein the transcriptional regulatory region: 

(a) is obtainable from a 5* region of a plant FAE1 gene; or 

(b) hybridizes under stringent conditions to the 5' region of the plant FAE1 gene; 
10 or 

(c) is at least 70% identical when optimally aligned to the 5' region of the plant 
FAE1 gene. 

2. The recombinant nucleic acid of claim 1 wherein the 5' region of the plant FAE1 gene 
15 comprises (5 r to 3'): 

AGA TCTAAGAACA CACATTCCCT CAAATTTTAA TGCACATGTA 
ATCATAGTTT AGCACAATTC AAAAATAATG TAGTATTAAA GACAGAAATT 
TGTAGACTTT TTTTTGGCGT TAAAGGAAGA CTAAGTTTAT ACGTACATTT 
TATTTTAAGT GGAAAACCGA AATTTTCCAT C GAAAT AT AT GAATTTAGTA 
20 TATATATTTC TGCAATGTAC TATTTTGCTA TTTTGGCAAC TTTCAGTGGA 

CTACTACTTT ATTACAATGT GTATGGATGC ATGAGTTTGA GTATACACAT 
GTCTAAATGC ATGCTTTGCA AAACGTAACG GACCACAAAA GAGGATCCAT 
GCAAATACAT CTCATAGCTT CCTCCATTAT TTTCCGACAC AAACAGAGCA. 

25 3. The recombinant nucleic acid of claim 1 wherein the 5' region of the plant FAE1 gene 
comprises (5 f to 3'): 

AAGGCTTACC CTATTAGTTG AAAGTTGAAA CTTTGTTCCC TACTCAATTC 

CTAGTTGTGT AAATGTATGT ATATGTAATG CGTATAAAAC GTAGTACTTA 

AATGACTAGG AGTGGTTCTT GAGACCGATG AGAGATGGGA GCAGAACTAA 

30 A GAT GAT G AC ATAATTAAGA ACGAATTTGA AAGGCTCTTA GGTTTGAATC 

CTATTCGAGA ATGTTTTTGT CAAAGATAGT GGCGATTTTG AACCAAAGAA 

AACATTTAAA AAATCAGTAT CCGGTTACGT TCATGCAAAT AGAAAGTGGT 

CTAGGATCTG ATTGTAATTT TAG AC T T AAA GAGTCTCTTA AGATTCAATC 

CTGGCTGTGT ACAAAACTAC AAATAATATA TTTTAGACTA TTTGGCCTTA 

35 ACTAAACTTC CACTCATTAT TTACTGAGGT TAGAGAATAG ACTTGCGAAT 
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AAACACATTC CCGAGAAATA CTCATGATCC CATAATTAGT CAGAGGGTAT 

GCCAATCAGA TCTAAGAACA CACATTCCCT CAAATTTTAA TGCACATGTA 

ATCATAGTTT AGCACAATTC AAAAATAATG TAGTATTAAA GACAGAAATT 

TGTAGACTTT TTTTTGGCGT TAAAGGAAGA CTAAGTTTAT ACGTACATTT 

5 TATTTTAAGT GGAAAACCGA AATTTTCCAT CGAAATATAT GAATTTAGTA 

TATATATTTC TGCAATGTAC TATTTTGCTA TTTTGGCAAC TTTCAGTGGA 

CTACTACTTT ATTACAATGT GTATGGATGC ATGAGTTTGA GTATACACAT 

GTCTAAATGC ATGCTTTGCA AAACGTAACG GACCACAAAA GAGGATCCAT 

GCAAATACAT CTCATAGCTT CCTCCATTAT TTTCCGACAC AAACAGAGCA. 

10 

4. The recombinant nucleic acid of claim 1 wherein the 5' region of the plant FAE1 gene 
comprises (5 f to 3'): 
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AAGACTTTTG 


CGGCGGCGCG 


TGAAGAGACG 


GAGCAAGTTA 


TCATTGGTGC 




GCTAGAAAAT 


CTATTCAAGA 


ACACCAACGT 


TAACCCTAAA 


GATATAGGTA 




TACTTGTGGT 


GAACTCAAGC 


ATGTTTAATC 


CAACTCCATC 


GCTCTCCGCG 




ATGGTCGTTA 


ACACTTTCAA 


GCTCCGAAGC 


AACGTAAGAA 


GCTTTAACCT 




TGGTGGCA7G 


GGTTGTAGTG 


CCGGCGTTAT 


AGCCATTGAT 


CTAGCAAAGG 


25 


ACTTGTTGCA 


TGTCCATAAA 


AATACGTATG 


CTCTTGTGGT 


GAGCACAGAG 




AACATCACTT 


ATAACATTTA 


CGCTGGTGAT 


AATAGGTCCA 


TGATGGTTTC 




AAATTGCTTG 


TTCCGTGTTG 


GTGGGGCCGC 


TATTTTGCTC 


TCCAACAAGC 




CTGGAGATCG 


TAGACGGTCC 


AAGTACGAGC 


TAGTTCACAC 


GGTTCGAACG 




C AT AC CGG AG 


CTGACGACAA 


GTCTTTTCGT 


TGCGTGCAAC 


AAGGAGACGA 


30 


TGAGAACGGC 


AAAAT C G GAG 


TGAGTTTGTC 


CAAGGACATA 


ACCGATGTTG 




CTGGTCGAAC 


GGTTAAGAAA 


AACATAGCAA 


CGTTGGGTCC 


GTTGATTCTT 




CCGTTAAGCG 


AGAAACTTCT 


TTTTTTCGTT 


ACCTTCATGG 


GCAAGAAACT 




TTTCAAAGAT 


AAAAT CAAAC 


ATTACTACGT 


CCCGGATTTC 


AAACTTGCTA 




TTGACCATTT 


TTGTATACAT 


GCCGGAGGCA 


GAGCCGTGAT 


TGATGTGCTA 


35 


GAGAAGAACC 


TAGCCCTAGC 


ACCGATCGAT 


GTAGAGGCAT 


CAAGATCAAC 




GTTACATAGA 


TTTGGAAACA 


CTTCATCTAG 


CTCAATATGG 


TATGAGTTGG 
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CATACATAGA AGCAAAAGGA AGGATGAAGA AAGGTAATAA AGTTTGGCAG 
ATTGCTTTAG GGTCAGGCTT TAAGTGTAAC AGTGCAGTTT GGGTGGCTCT 
AAACAATGTC AAAGCTTCGA CAAATAGTCC TTGGGAACAC TGCATCGACA 
GATACCCGGT CAAAATTGAT TCTGATTCAG GTAAGTCAGA GACTCGTGTC 
CAAAACGGTC GGTCCTAATA AACGATGTTT GCTCTCTTTC GTTTCTTTTT 
ATTTGTTATA ATAATTTGAT GGCTACGATG TTTCTCTTGT TTGTTATGAA 
TAAAGAATGC AATGGTGTTC TAGTATTTGA TTGTTTTACA TGTATGTATC 
TCTTATTTAC ATGAAATTTT TAAACGCCTA AAAAAAAAAA CGGAATTCCG. 

The recombinant nucleic acid of claim 1 wherein the 5' region of the plant FAE1 gene 
comprises (5' to 3'): 









CAGCTTAAC 
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GGCCTGTACA 
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AGACTAATTT 


TAATTTACTG 


GCAAAAATAG 


AAATCAATTT 


ATAAGTGTTT 


AAACAAATCG 


ATGGTATAAC 


TGATTAGTGA 


TCACTCTTAG 


GTTTTGATCC 


AACTCGAGTA 


TTGAGTATTG 


AACGCTTTTT 


TTAAATAAAA 


TCTTGATTTT 


TAAATTGGTT 


TTTTGAGTAA 


AAAAGTTCTT 


AATATTTTCT 


CTTTGTTTTA 


ATGGGTTTGT 


TTTGCATTTT 


ATAAGCTTAA 


TTTTTCTAAT 


TTAATATTTT 


ATCTATCATC 


GTCCGTAAAG 


TTTTATTTGG 


CACAAACTTG 


TTTTACTTTT 


CTACCTTATA 


ATTTGGGAAC 


TGGTTGAGTC 


AAAGCGTACC 


GGACAAATAT 


GTTTTATATT 


CTTATTTAAG 


AATTAACACT 


CATCTCATAA 


TTAGTCAGAG 


GCTAGGGAGA 


TTCAGCCAAT 


CAATGCTAAC 


AACAAAATTC 


TCTTAATGAT 


CTAACGATGC 


TATTTAATAT 


TCGGATCAGT 


ATTCTTAAAT 


AAGAATATAA 


AACTAATTCA 


ATAGTTACAG 


ATAAAAACTT 


ATATAGACTT 


TTTTATTTGG 


AATATAAAAG 


TATCAATATA 


TTATAGACAA 


TATTTATAAC 


GTTAAAAATA 


CAATATTTAT 


ATTTTTTATA 


TATTTATTTC 


AAATTGAAAA 


G CAT TACT TC 


TATCGAAATG 


AATTTTAGTA 


TATTAATTAA 


TATTTTTTTA 


ATCGGACTAC 


TTTCCTATTT 


TGGCACCTTT 


CATCTGACTA 


CTAATTTATT 


TCAATGTGTA 


TGCATGCATG 


AG CAT GAG T A 


AT AC AC AT GT 


CTATATAAAT 


GCATGTAAAA 


CGTAACGGAC 


CACAAAAGTG 


GATCCATACA 


AATACATCTC 


ATCGCACCCT 


CTCCGACACA 


AAACTGAACA. 
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The recombinant nucleic acid of claim 1 wherein the promoter sequence is selected 
from the group consisting of Arabidopsis thaliana. Lunaria annua and Brassica napus 
FAE1 promoter sequences. 

The recombinant nucleic acid of any one of claims 1 through 6, wherein the 
transcriptional regulatory region is at least 70% identical when optimally aligned to 
the 5' region of the plant FAE1 gene.. 



The recombinant nucleic acid of claim 1 wherein the transcriptional regulatory region 
comprises (5' to 3'): 

AGA TCTAAGAACA CACATTCCCT CAAATTTTAA TGCACATGTA 
ATCATAGTTT AGCACAATTC AAAAATAATG TAGTATTAAA GACAGAAATT 
TGTAGACTTT TTTTTGGCGT TAAAGGAAGA CTAAGTTTAT ACGTACATTT 
TATTTTAAGT GGAAAACCGA AATTTTCCAT C GAAAT AT AT GAATTTAGTA 
TATATATTTC TGCAATGTAC TATTTTGCTA TTTTGGCAAC TTTCAGTGGA 
CTACTACTTT ATTACAATGT G TAT G GAT GC ATGAGTTTGA G TAT AC AC AT 
GTCTAAATGC ATGCTTTGCA AAACGTAACG GACCACAAAA GAGGATCCAT 
GCAAATACAT CTCATAGCTT CCTCCATTAT TTTCCGACAC AAACAGAGCA . 



The recombinant nucleic acid of claim 1 wherein the transcriptional regulatory region 
comprises (5 f to 3'): 



AAGGCTTACC 


CTATTAGTTG 


AAAGTTGAAA 


CTTTGTTCCC 


TACTCAATTC 


CTAGTTGTGT 


AAATGTATGT 


ATATGTAATG 


CGTATAAAAC 


GTAGTACTTA 


AATGACTAGG 


AGTGGTTCTT 


GAGACCGATG 


AGAGATGGGA 


GCAGAACTAA 


AG AT GAT G AC 


ATAATTAAGA 


ACGAATTTGA 


AAGGCTCTTA 


GGTTTGAATC 


CTATTCGAGA 


ATGTTTTTGT 


CAAAGATAGT 


GGCGATTTTG 


AACCAAAGAA 


AACATTTAAA 


AAATCAGTAT 


CCGGTTACGT 


TCATGCAAAT 


AGAAAGTGGT 


CTAGGATCTG 


ATTGTAATTT 


TAGACTTAAA 


GAGTCTCTTA 


AGATTCAATC 


CTGGCTGTGT 


ACAAAACTAC 


AAATAATATA 


TTTTAGACTA 


TTTGGCCTTA 


ACTAAACTTC 


CACTCATTAT 


TTACTGAGGT 


TAGAGAATAG 


ACTTGCGAAT 


AAACACATTC 


CCGAGAAATA 


CTCATGATCC 


CATAATTAGT 


CAGAGGGTAT 


GCCAATCAGA 


TCTAAGAACA 


CACATTCCCT 


CAAATTTTAA 


TGCACATGTA 


ATCATAGTTT 


AGCACAATTC 


AAAAATAATG 


TAGTATTAAA 


GACAGAAATT 


TGTAGACTTT 


TTTTTGGCGT 


TAAAGGAAGA 


CTAAGTTTAT 


ACGTACATTT 
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TATTTTAAGT GGAAAACCGA AATTTTCCAT CGAAATATAT GAATTTAGTA 

TATATATTTC TGCAATGTAC TATTTTGCTA TTTTGGCAAC TTTCAGTGGA 

CTACTACTTT ATTACAATGT GTATGGATGC ATGAGTTTGA GTATACACAT 

GTCTAAATGC ATGCTTTGCA AAACGTAACG G AC C AC AAAA GAGGATCCAT 

GCAAATACAT CTCATAGCTT CCTCCATTAT TTTCCGACAC AAACAGAGCA . 

The recombinant nucleic acid of claim 1 wherein the transcriptional regulatory region 
comprises (5 ! to 3'): 

CTGACTTC AC C AAAG AAA CAACTCGAGT CGTTATCCAT 
CTCCTCATAA CCATCGCTCC ACTCTTTGCC TTCACCGTTT TCGGTTCGGT 
TCTCTACATC GCAACCCGGC CCAAACCGGT TTACCTCGTT GAGTACTCAT 
GCTACCTTCC ACCAACGCAT TGTAGATCAA GTATCTCCAA GGTCATGGAT 
ATCTTTTATC AAGTAAGAAA AGCTGATCCT TCTCGGAACG GCACGTGCGA 
TGACTCGTCG TGGCTTGACT TCTTGAGGAA GATTCAAGAA CGTTCAGGTC 
TAGGCGATGA AACTCACGGG CCCGAGGGGC TGCTTCAGGT CCCTCCCCGG 
AAGACTTTTG CGGCGGCGCG TGAAGAGACG GAGCAAGTTA TCATTGGTGC 
GCTAGAAAAT CTATTCAAGA ACACCAACGT TAACCCTAAA GATATAGGTA 
TACTTGTGGT GAACTCAAGC ATGTTTAATC CAACTCCATC GCTCTCCGCG 
ATGGTCGTTA ACACTTTCAA GCTCCGAAGC AACGTAAGAA GCTTTAACCT 
TGGTGGCATG GGTTGTAGTG CCGGCGTTAT AGCCATTGAT CTAGCAAAGG 
ACTTGTTGCA TGTCCATAAA AATACGTATG CTCTTGTGGT GAGCACAGAG 
AACATCACTT ATAACATTTA CGCTGGTGAT AATAGGTCCA TGATGGTTTC 
AAATTGCTTG TTCCGTGTTG GTGGGGCCGC TATTTTGCTC TCCAACAAGC 
CTGGAGATCG TAGACGGTCC AAGTACGAGC TAGTTCACAC GGTTCGAACG 
C AT AC CGGAG CTGACGACAA GTCTTTTCGT TGCGTGCAAC AAGGAGACGA 
TGAGAACGGC AAAAT C G GAG TGAGTTTGTC CAAGGACATA ACCGATGTTG 
CTGGTCGAAC GGTTAAGAAA AACATAGCAA CGTTGGGTCC GTTGATTCTT 
CCGTTAAGCG AGAAACTTCT TTTTTTCGTT ACCTTCATGG GCAAGAAACT 
TTTCAAAGAT AAAAT C AAA C ATTACTACGT CCCGGATTTC AAACTTGCTA 
TTGACCATTT TTGTATACAT GCCGGAGGCA GAGCCGTGAT TGATGTGCTA 
GAGAAGAACC TAGCCCTAGC ACCGATCGAT GTAGAGGCAT CAAGATCAAC 
GTTACATAGA TTTGGAAACA CTTCATCTAG CTCAATATGG TATGAGTTGG 
CATACATAGA AG C AAAAG G A AGGATGAAGA AAGGTAATAA AGTTTGGCAG 
ATTGCTTTAG GGTCAGGCTT TAAGTGTAAC AGTGCAGTTT GGGTGGCTCT 
AAACAATGTC AAAGCTTCGA CAAATAGTCC TTGGGAACAC TGCATCGACA 
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GATACCCGGT CAAAATTGAT TCTGATTCAG GTAAGTCAGA GACTCGTGTC 

CAAAACGGTC GGTCCTAATA AACGATGTTT GCTCTCTTTC GTTTCTTTTT 

ATTTGTTATA ATAATTTGAT GGCTACGATG TTTCTCTTGT TTGTTATGAA 

TAAAGAATGC AATGGTGTTC TAGTATTTGA TTGTTTTACA TGTATGTATC 

5 TCTTATTTAC ATGAAATTTT TAAACGCCTA AAAAAAAAAA CGGAATTCCG. 



1 1 . The recombinant nucleic acid of claim 1 wherein the transcriptional regulatory region 
comprises (5' to 3'): 

10 









CAGCTTAAC 


CGGTAAAATT 


GGCCTGTACA 


TATATTTACC 


ACTGAGTAAA 


GACATCAGTT 


AATGATTTGT 


TGTTACTCAA 


TTGGGCTAAG 


TGTATTATTA 


TATGTGTTGT 


ATATAATAAA 


GGTAGAACGT 


AAATTTACTA 


AGAATGTGTT 


TTTCCAATGT 


GATTGCTCTT 


TGGCCTCTTA 


GGTTTGAATC 


CTACTCGAGA 


AGACTAATTT 


TAATTTACTG 


G C AAAAAT AG 


AAATCAATTT 


ATAAGTGTTT 


AAACAAATCG 


ATGGTATAAC 


TGATTAGTGA 


TCACTCTTAG 


GTTTTGATCC 


AACTCGAGTA 


TTGAGTATTG 


AACGCTTTTT 


TTAAATAAAA 


TCTTGATTTT 


TAAATTGGTT 


TTTTGAGTAA 


AAAAGTTCTT 


AATATTTTCT 


CTTTGTTTTA 


ATGGGTTTGT 


TTTGCATTTT 


ATAAGCTTAA 


TTTTTCTAAT 


TTAATATTTT 


ATCTATCATC 


GTCCGTAAAG 


TTTTATTTGG 


CACAAACTTG 


TTTTACTTTT 


CTACCTTATA 


ATTTGGGAAC 


TGGTTGAGTC 


AAAGCGTACC 


GGACAAATAT 


GTTTTATATT 


CTTATTTAAG 


AATTAACACT 


CATCTCATAA 


TTAGTCAGAG 


GCTAGGGAGA 


TTCAGCCAAT 


CAATGCTAAC 


AACAAAATTC 


TCTTAATGAT 


CTAACGATGC 


TATTTAATAT 


TCGGATCAGT 


ATTCTTAAAT 


AAGAATATAA 


AACTAATTCA 


ATAGTTACAG 


ATAAAAACTT 


ATATAGACTT 


TTTTATTTGG 


AATATAAAAG 


TATCAATATA 


TTATAGACAA 


TATTTATAAC 


GTTAAAAATA 


CAATATTTAT 


ATTTTTTATA 


TATTTATTTC 


AAATTGAAAA 


GCATTACTTC 


TATCGAAATG 


AATTTTAGTA 


TATTAATTAA 


TATTTTTTTA 


ATCGGACTAC 


TTTCCTATTT 


TGGCACCTTT 


CATCTGACTA 


CTAATTTATT 


TCAATGTGTA 


TGCATGCATG 


AGCATGAGTA 


ATACACATGT 


CTATATAAAT 


GCATGTAAAA 


CGTAACGGAC 


CACAAAAGTG 


GAT C CAT AC A 


AATACATCTC 


ATCGCACCCT 


CTCCGACACA 


AAACTGAACA. 



12. The recombinant nucleic acid of any one of claims 1 through 1 1 wherein the nucleic 
35 acid sequence encodes a translatable mRNA. 
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13. The recombinant nucleic acid of claim 12 wherein the nucleic acid sequence encodes 
an enzyme involved in lipid metabolism. 

14. The recombinant nucleic acid of any one of claims 1 through 13, further comprising a 
transcription termination region operably linked to the nucleic acid sequence. 

15. A host cell comprising the recombinant nucleic acid of any one of claims 1 through 14. 

16. The host cell of claim 15, wherein the host cell is of a dicotyledonous plant species. 

17. A plant comprising the recombinant nucleic acid of any one of claims 1 through 14. 

18. The plant of claim 17, wherein the plant is of a dicotyledonous plant species. 

19. A method of altering the phenotype of a seed comprising: 

a) transforming a seed-bearing plant, or a progenitor of the seed-bearing plant, 
with a vector comprising the nucleic acid of any one of claims 1 through 14; 

b) growing the seed-bearing plant to obtain seed under conditions wherein the 
nucleic acid sequence is expressed during embryogenesis under the control of 
the transcriptional regulatory region to alter the phenotype of the seed. 

20. A method of transforming a plant cell comprising transforming the plant cell with the 
recombinant nucleic acid of any one of claims 1 through 14. 



-24- 



WO 01/11061 



PCT/CA00/00907 



Figure 1. 



Arabidopsis thaliana FAE1 promoter: 
(Length: 934 bp) 



-950 



ACTCA T AAAAAC TAG TAGATTGGTT GGTTGGTTTC CATGTACCAG 



AtproFW 



-900 AAGGCTTACC CTATTAGTTG AAAGT T G AAA CTTTGTTCCC TACTCAATTC 
-8 50 CTAGTTGTGT AAATGTATGT ATATGTAATG CGTATAAAAC GTAGTACTTA 
-8 00 AATGACTAGG AGTGGTTCTT GAGACCGATG AGAGATGGGA GCAGAACTAA 
-7 50 AG AT GAT G AC ATAATTAAGA ACGAATTTGA AAGGCTCTTA GGTTTGAATC 
-7 00 CTATTCGAGA ATGTTTTTGT CAAAGATAGT GGCGATTTTG AACCAAAGAA 
-650 AAC AT T T AAA AAATCAGTAT CCGGTTACGT TCATGCAAAT AGAAAGTGGT 
-600 CTAGGATCTG ATTGTAATTT TAGACTTAAA GAGTCTCTTA AGATTCAATC 
-550 CTGGCTGTGT AC AAAAC T AC AAATAATATA TTTTAGACTA TTTGGCCTTA 
-500 ACTAAACTTC CACTCATTAT TTACTGAGGT TAGAGAATAG ACTTGCGAAT 
-4 50 AAACACATTC CCGAGAAATA CTCATGATCC CATAATTAGT CAGAGGGTAT 
-4 00 GCCAATCAGA TCTAAGAACA CACATTCCCT CAAATTTTAA TGCACATGTA 
-350 ATCATAGTTT AGCACAATTC AAAAATAATG TAGTATTAAA GACAGAAATT 
-300 TGTAGACTTT TTTTTGGCGT TAAAGGAAGA CTAAGTTTAT ACGTACATTT 
-2 50 TATTTTAAGT GGAAAACCGA AATTTTCCAT CGAAATATAT GAATTTAGTA 
-200 TATATATTTC TGCAATGTAC TATTTTGCTA TTTTGGCAAC TTTCAGTGGA 
-150 CTACTACTTT ATTACAATGT GTATGGATGC ATGAGTTTGA GTATACACAT 
-100 GTCTAAATGC ATGCTTTGCA AAACGTAACG GACCACAAAA GAGGATCCAT 
-50 GCAAATACAT CTCATAGCTT CCTCCATTAT TTTCCGACAC AAACAGAGCA 



AtproRV 



1 ATOACGTCCG TTAACGTTAA 



GCTCCTT 



1/15 



WO 01/11061 



PCT/CA00/00907 



Figure 2. Brassica napus FAE1 promoter: 
(Length: 1588 bp) 

-1600 GGTTGGGCAA AT CTGACTTC AC C AAAG AAA CAACTCGA GT CGTTATCCAT 

BnproFW 

-1550 CTCCTCATAA CCATCGCTCC ACTCTTTGCC TTCACCGTTT TCGGTTCGGT 
-1500 TCTCTACATC GCAACCCGGC CCAAACCGGT TTACCTCGTT GAGTACTCAT 
-14 50 GCTACCTTCC ACCAACGCAT TGTAGATCAA GTATCTCCAA GGTCATGGAT 
-14 00 ATCTTTTATC AAGTAAGAAA AGCTGATCCT TCTCGGAACG GCACGTGCGA 
-1350 TGACTCGTCG TGGCTTGACT TCTTGAGGAA GATTCAAGAA CGTTCAGGTC 
-1300 TAGGCGATGA AACTCACGGG CCCGAGGGGC TGCTTCAGGT CCCTCCCCGG 
-1250 AAGACTTTTG CGGCGGCGCG TGAAGAGACG GAGCAAGTTA TCATTGGTGC 
-1200 GCTAGAAAAT CTATTCAAGA ACACCAACGT TAACCCTAAA GATATAGGTA 
-1150 TACTTGTGGT GAACTCAAGC ATGTTTAATC CAACTCCATC GCTCTCCGCG 
-1100 ATGGTCGTTA ACACTTTCAA GCTCCGAAGC AACGTAAGAA GCTTTAACCT 
-1050 TGGTGGCATG GGTTGTAGTG CCGGCGTTAT AGCCATTGAT CTAGCAAAGG 
-1000 ACTTGTTGCA TGTCCATAAA AATACGTATG CTCTTGTGGT GAGCACAGAG 
-950 AACATCACTT ATAACATTTA CGCTGGTGAT AATAGGTCCA TGATGGTTTC 
-900 AAATTGCTTG TTCCGTGTTG GTGGGGCCGC TATTTTGCTC TCCAACAAGC 
-8 50 CTGGAGATCG TAGACGGTCC AAGTACGAGC TAGTTCACAC GGTTCGAACG 
-800 CATACCGGAG CTGACGACAA GTCTTTTCGT TGCGTGCAAC AAGGAGACGA 
-750 TGAGAAC GGC AAAATCGGAG TGAGTTTGTC CAAGGACATA ACCGATGTTG 
-700 CTGGTCGAAC GGTTAAGAAA AACATAGCAA CGTTGGGTCC GTTGATTCTT 
-650 CCGTTAAGCG AGAAACTTCT TTTTTTCGTT ACCTTCATGG GCAAGAAACT 
-600 TTTCAAAGAT AAAATCAAAC ATTACTACGT CCCGGATTTC AAACTTGCTA 
-550 TTGACCATTT TTGTATACAT GCCGGAGGCA GAGCCGTGAT TGATGTGCTA 
-500 GAGAAGAACC TAGCCCTAGC ACCGATCGAT GTAGAGGCAT CAAGATCAAC 
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Figure 2 Continued Brassica nap us FAE1 promoter: 

-4 50 GTTACATAGA TTTGGAAACA CTTCATCTAG CTCAATATGG TATGAGTTGG 

-400 CATACATAGA AGCAAAAGGA AGGATGAAGA AAGGTAATAA AGTTTGGCAG 

-350 ATTGCTTTAG GGTCAGGCTT TAAGTGTAAC AGTGCAGTTT GGGTGGCTCT 

-300 AAACAATGTC AAAGCTTCGA CAAATAGTCC TTGGGAACAC TGCATCGACA 

-250 GATACCCGGT CAAAATTGAT TCTGATTCAG GTAAGTCAGA GACTCGTGTC 

-200 CAAAACGGTC GGTCCTAATA AACGATGTTT GCTCTCTTTC GTTTCTTTTT 

-150 ATTTGTTATA ATAATTTGAT GGCTACGATG TTTCTCTTGT TTGTTATGAA 

-10 0 TAAAGAATGC AATGGTGTTC TAGTATTTGA TTGTTTTACA TGTATGTATC 

-50 TCTTATTTAC ATGAAATTTT TAAAC GCCTA AAAAAAAAAA CGGAATTCCG 

^~ BnproRV 

1 [ATGj ACGTCCA TTAACGTAAA GCTCCTTTAC CATTACGTCA TAACCAACCT 

51 TTTCAACCTT TGCTTCTTTC CG TTAACGGC GATCGTCGCC GGAAAAGCCT 
^- Bnwalk2 

101 ATCGGCTTAC CATAGACGAT CTTCACCACT TATACTATTC CTATCTCCAA 

151 CACAA CCTCA TAACCATCGC TCCACTCTTT GCCTTCACCG 
<- Bnwalkl 
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Figure 3. Lunaria annua FAE1 promoter: 
(Length: 1069 bp) 

-1100 CG CCGGGGAGTT T CAGCTTAAC CGGTAAAATT 

LaproFW 

-1050 GGCCTGTACA TATATTTACC ACTGAGTAAA GACATCAGTT AATGATTTGT 
-1000 TGTTACTCAA TTGGGCTAAG TGTATTATTA TATGTGTTGT ATATAATAAA 
-950 GGTAGAACGT AAATTTACTA AGAATGTGTT TTTCCAATGT GATTGCTCTT 
-900 TGGCCTCTTA GGTTTGAATC CTACTCGAGA AGACTAATTT TAATTTACTG 
-8 50 GCAAAAATAG AAATCAATTT ATAAGTGTTT AAACAAATCG ATGGTATAAC 
-8 00 TGATTAGTGA TCACTCTTAG GTTTTGATCC AACTCGAGTA TTGAGTATTG 
-7 50 AACGCTTTTT TTAAATAAAA TCTTGATTTT TAAATTGGTT TTTTGAGTAA 
-700 AAAAGTTCTT AATATTTTCT CTTTGTTTTA ATGGGTTTGT TTTGCATTTT 
-650 ATAAGCTTAA TTTTTCTAAT TTAATATTTT ATCTATCATC GTCCGTAAAG 
-600 TTTTATTTGG CACAAACTTG TTTTACTTTT CTACCTTATA ATTTGGGAAC 
-55 0 TGGTTGAGTC AAAGCGTACC GGACAAATAT GTTTTATATT CTTATTTAAG 
-500 AATTAACACT CATCTCATAA TTAGTCAGAG GCTAGGGAGA TTCAGCCAAT 
-4 50 CAATGCTAAC AACAAAATTC TCTTAATGAT CTAACGATGC TATTTAATAT 
-4 00 TCGGATCAGT ATTCTTAAAT AAGAATATAA AACTAATTCA ATAGTTACAG 
-350 ATAAAAACTT ATATAGACTT TTTTATTTGG AATATAAAAG TATCAATATA 
-3 0 0 T TAT AG AC AA TATTTATAAC GTTAAAAATA CAATATTTAT ATTTTTTATA 
-2 50 TATTTATTTC AAATTGAAAA GCATTACTTC TATCGAAATG AATTTTAGTA 
-2 00 TATTAATTAA TATTTTTTTA ATCGGACTAC TTTCCTATTT TGGCACCTTT 
-150 CATCTGACTA CTAATTTATT TCAATGTGTA TGCATGCATG AGCATGAGTA 
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Figure 3 Continued. Lunaria annua FAE1 promoter: 

-100 ATACACATGT CTATATAAAT GCATGTAAAA CGTAACGGAC CACAAAAGTG 

-50 GATCCATACA AATACATCTC ATCGCAC CCT CTCCGACACA AAACTGAACA 

LaproRV 

1 |ATG| ACGTCTG TGAACGTAAA ACTCCTTTAC CATTACGTCA TAACCAACTT 

51 TTT CAACCTC TGTTTCTTCC CACTGAC GGG GATCCTCGCC GGAAAAGGCT 

Lawalk2 

101 CTCGTCTTAC CACAAACGAT C TCCACCA 
<T Lawalkl 
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Figure 4: Alignment of A.L, L.cu and B.n. FAE1 promoters 

CLUSTAL W (1.74) multiple sequence alignment 



A. t. 

L.a. 

B. n. GGTTGGGCAAATCTGACTTCACCAAAGAAACAACTCGAGTCGTTATCCATCTCCTCATAA 

A. t. 

L . a . 

B. n. CCATCGCTCCACTCTTTGCCTTCACCGTTTTCGGTTCGGTTCTCTACATCGCAACCCGGC 

A. t. 

L.a. 

B. n. CCAAACCGGTTTACCTCGTTGAGTACTCATGCTACCTTCCACCAACGCATTGTAGATCAA 

A. t. 

L.a. 

B. n. GTATCTCCAAGGTCATGGATATCTTTTATCAAGTAAGAAAAGCTGATCCTTCTCGGAACG 

A. t. 

L.a. 

B. n. GCACGTGCGATGACTCGTCGTGGCTTGACTTCTTGAGGAAGATTCAAGAACGTTCAGGTC 

A.t. 

L.a. • 

B - -n - TAGGCGATGAAACTCACGGGCCCGAGGGGCTGCTTCAGGTCCCTCCCCGGAAGACTTTTG 

A. t. 

L.a. 

B. n. CGGCGGCGCGTG AAGAGACGG AGC AAGTTATC ATT GGTGCGCT AG AAAATCT AT TCAPi-G A 

A. t. 

L.a. 

B. n. ACACCAACGTTAACCCTAAAGATATAGGTATACTTGTGGTGAACTCAAGCATGTTTAATC 

A. t. 

£ • a • CGCCGGGGAGTTTCAGCTTAA 

B. n. CAACTCCATCGCTCTCCGCGATGGTCGTTAACACTTTCAAGCTCCGAAGCAACGTAAGAA 

Con. 4 NNNNNNNNNNNNNNNNNNNNN 

A. t. ACTCATAAAAACTAGTAGAT — TGGTTGG 

L.a. CCGGTAAAATTGGCCTGTACATATATTTACCACTGAGTAAAGACATCAGTTAATGATTTG 

B. n. GCTTTAACCTTGGTGGCATGGGTTGTAGTGCCGGCGTTATAGCCATTGATCTAGCAAAGG 

* *** ★★★★ * * * * * 

Con. 4 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNMSKSRKWTWARMYCKYRRWYNNKSRWWKG 
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Figure 4 Continued: Alignment of A.L, L.a. and B.n. FAE1 promoters 



A. t . TTGGTTTCCA — TGTACCAGAAGGCTTACCCTAT-TAGTTGAAAGTTGAAACTTTGTTCC 
L.a. TTGTTACTCAATTGGGCTAAGTGTATTATTATAT-GTGTTGTATATAATAAAGGTAGAAC 

B. n. ACTTGTTGCATGTCCATAAAAATACGTATGCTCTTGTGGTGAGCACAGAGAACATCACTT 

Con . 4 WYKKKWYBCANNTSBRYHARRWKDMKTAYBMTMTNKWGKTGWRHRYWRWRAMBDTVDHHY 



A. t . CTACTCAATTCCTAGTTGTGTAAATGT ATGTATATGTAAT GCGTATAAAACGTA 

L.a. GTAA — ATTTACTAAGAATGTGTTTTTCCAATGTGATTGCTCTTTGGCCTCTTAGGTTTG 

B. n. ATAA-CATTTACGCTGGTGATAATAGGTCCATGATGGTTTCAAATTGCTTGTTCCGTGTT 

Con . 4 VTAMNNAWTTMCMMDKDDKRTRWWWKKNNNATGWDDDTKYHMWNNNGCBTVTWMVRYKTD 

A. t . GTACTTAAATGACTAGGAGTGGTTCTTGAGACCGATGAGAGATGGGAG-CAGAACTAAAG 
L.a. AATCCTACTCGAGAAG-ACTAATTTTAATTTACTGGCAAAAATAGAAA-TCAATTTATAA 

B. n. GGTGGGGC-CG CTATTTTGC TCTCCAACAAGCCTGGAGATCGTAGACGGTCCAAGTACGA 

★ * * * * * + ★ 

Con . 4 RDWSBKRMNYGMBWWKNWS YDVTYYWWVWDDMCKRKVRRWVRTRGRMRNYMVAWBTAHRR 

A. t . AT — GATGACATAATTA AGAACGAATTTGA-AAGG-CTCTTAGGTTTGAATCCT 

L.a. GT — GTTTAAACAAATCGATGGTATAACTGATTAGT-GATCACTCTTAGGTT7TGATCCA 

B. n. GCTAGTTCACACGGTTCGAACGCATACCGGAGCTGACGACAAG7CTTTTCGTTGCGTGCA 

Con . 4 RYNNGWTBAMAYRRWTMNNNNNNAKAMC KRAKYWGWNRAB VNS TCTTWKS KTT KVRT S CW 



A. t . ATTCGAGAATGTTTTTGTCAAAGATAGTGGCGATTTTGAACCAAAGAAAACATTTAAA-A 
L.a. ACTCGAGTATTGAGTATTGAACGCTT TTTTTAAATAAAATCTTGATTTTTA-A 

B. n. A — C AAG G A - G AC GAT G AG AAC GG CAA AATCGGAGTGAGTTTGTCCAAGGACATA 

★ *★ +* ★* ** ★ + * + * ★ + _* 

Con . 4 ANNCRAGDANKDHKWWKWSAAMGVYWNNNNNNNWTYKKARHBARWDWVWHSAWKKWHANA 



A. t . AATCAGTATCCGGTTAC GTTCATGCAAATAGAAAGTGGTCTA GGATCTGATT- 

L . a . ATTGGTTTTTTGAGTAAAAAAGTTCTTAATATTTTCTCTTTGTTTTAATGGGTTTGTTT- 

B. n. ACCGATGTTGCTGGTCGAACGGTTAAGAAAAACATAGCAACGTT GGGTCCGTTGA 

* * *■ * ★ + ★ ★ ★ + * 

Con . 4 AHYSRKKWTBYKRKTMVNNNNGTTMWKRMWAWYWKMDMDWBGTYNNNNNGGRTYYGWTKN 



A. t . G T AAT T T TAG A — CTTAAAGAGTCTC — TTAAGATTCAATCCTGGCT-GTGTACAAAACT 
L.a. TGCATTTTATAAGCTTAATTTTTCTAATTTAATATTTTATCTATCATCGTCCGTAAAGTT 

B. n. TTCTTCCGTTAAGCGAGAAACTTCTT — TTTTTCGTTA — CCTTCATGGGCAAGAAACTT 

Con . 4 KKMWT YYKWKANNCKWRAWDHKTCTHNNTTWWKMKTYWNNCYWKSMTNGKSHRBAAAVYT 

A. t. ACAAATAATATA TTTTAGACTATTTGGCCTTAACTAAACTTCCA-CTCATTATTTA 

L.a. TTATTTGGCACAAACTTGTTTTACTTTTCTACCTTA — TAATTTGGGAACTGGTTGAGT- 

B. n. TTCAAAGATAAAATCAAACATTACTACGTCCCGGATTTCAAACTTGCTATTGACCATTTT 

Con . 4 WYMWWWRRYAHANNNNWDYWWKACTWYKYBVCSKWWNNYAAWYTKSSWNYTSRYYRWKTN 

A. t . -CTGAGGTTAGAGAA — T AGAC T T G C G AAT AAA C AC AT T C C C G AG AAAT AC T CAT GAT C C 
L.a. -CAAAGCGTACCGGA — C AAAT AT GT T T - T ATAT T C TT AT T T AAG AAT T AAC AC T C ATCT 

B. n. T G TAT AC AT GCCG G AGG C AG AG C C G T GAT T G AT G T G C TAG AG AAG AAC C TAG C C C T AGC A 

****** ★ * **** + * 

Con . 4 NSWRWRS DTRSMGRANN YARABH YG YKWNTRWWBWS HTWBHBRAGAAH YWMBMMYBAKCH 
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Figure 4 Continued: Alignment of A.L, L.a. and B.n. FAE1 promoters 

CE3 

A - £ . CATAATTAGTCAGAGGGTATG C C AAT C AG AT C T AAG AAC [AC AC AT T C C C T C[ 

L.a. CATAATTAGTCAGAGGCTAGGGAGATTCAGCCAATCAATGCTAACAACAAA-ATTCTCTT 

B.n. C C GAT C GAT G T AG AG GC AT C AAG AT C AAC G T T AC AT AG AT T T G G 

* * * ***** ** * ******* ** * + * 

Con . 4 CMKAWYKAKKYAGAGGSNNNNNNNNNNNNNNNNATCARDDY YAASRWYAMANAKWYYYKB 

A. t. AA — ATTTTA — AT G C AC - AT G T AAT CAT AGTTT AG C AC AAT T C AAAA 

£ • a ■ AATGATCTAACGATGCT — AT T T AAT AT T C G GAT C AG TAT T C T T AAAT AAG AA T AT AAAA 

S . n . AAAC AC T T CAT C TAG C T C AAT AT G G TAT G AGTT G GC AT AC AT -AGAAG-C AAAA 

** * * * ** * ★* * ★ *** * ** * * ** **** 

Con . 4 AANNAYYTHANNWWGCWNNATDTRRTMWKNNNNNNAGTWKNNNNNNAKNASAAKNYAAAA 

A. t . ATAATGTAGTA-TTAAAGACAGAAATTTGTA — GACTTTTTT TTGGCGT-TAAAGG 

L.a. CT AAT T C AAT AG T T AC AGATAAAAAC T TAT AT AG AC TTTTTTAT — TTGGAATATAAAAG 

B. n. GGAAGGATGAA-GAAAGGTAATAAAGTTTGGCAGATTGCTTTAGGGTCAGGCTTTAAGTG 

** * + * + *★★ ** *** ★ **** ★ * ★ *** ik- 

Con . 4 VKAAKKHWRWANKWAMRGWHADAAABTTDKRNNGAYTKYTTTNNNNTYRGVVTNTAARDG 

A. t. AA GACTAAGTTTATA-CGT AC AT T T - T AT TT T AAG T 

L.a. T AT C AAT AT AT TATA- G AC AAT AT T T AT AACG T T AAAAAT AC AAT AT T TAT AT T TT T TAT 

B. n. TAACAGTGCAGTTTGGGTGGCTCTAAACAATGTCAAAGCTTCGACAAATAGTCCTTGGGA 

★ *★ * * ★ + * * * ** ** *** * * * * *★ ★ * 

Con . 4 WANNNNNNNNNNNNNNGWS DMWVTWWAYAN YGTNNNNNNNNNNAYAWWTNKWY YTTDDRW 

CE1 

A. t . GGA AAACCGAAATT — |T T C CAT C GAi AAT AT AT G A — ATTT-AGTATAT 

L.a. AT AT T TAT T T C AAAT T G AAAAGC AT TACT T C TAT C G AAAT G A — ATTTTAGTATATTAAT 

B. n. ACACT GC ATCG ACAG AT ACCCGGTCAAAATTGATTCTG ATT CAGGTAAGTC AG A 

* * ***** ** * ** *** *★* * ★ 

Con . 4 RBAYTNNNNNNRMAYYGAYADDYAYYMSDTCDAWMKWDATKMNNATTYNRGTAWRTNNNN 

G-box2 

A. t . — ATATTTCTGCAAT GTA CTATTTTGCTATTTTG G |CAA-CTT1 TCAGTGGACTAC 

L.a. TAATATTTTTTTAATC GGACTACTTTCCTATTTTGGCAC-CTTTCATCTGACTAC 

B. n. GACTCGTGTCCAAAACGGTCGGTCCTAATAAACGATGTTTGCTCTCTTTCGTTT — CTTT 

* * * + * * * *** + ★ ** ** ** ***** » * + * 

Con . 4 NNMTMKTKYYBHAAWNNNNNNGKMCTAHTWWVCKATKTTKGCWMNCTTTCRKYKNNCTWY 

G-boxl 

A. t. TACTTTATTACAATGTGT — ATGGATGC-ATGAG TTTGAGTA-TA JCACATGi TCTAAA 

L.a. TAATTTATTTCAATGTGT--ATGCATGC-ATGAG CATGAGTAATACACATGTCTATA 

B. n. TTATTTGTTATAATAATTTGATGGCTACGATGTTTCTCTTGTTTGTTATGAATAAAGAAT 

* *** ** *** * *** * * *** ** * w+ * 

Con . 4 TWMTTTRTTWYAATRWKTNNATGSMTRCNATGWKNNNYWTGWKTRWTAYRMATRWMKAWW 

A-300 EMI ABA 

A. t . T GC AT GC T - TTG CAAAACG T AAC G G AC C - AC AAAAG AG G AT CCA lTGC AAAT AC AT CTCAT[ 
L.a. TAAATGCA-T-GTAAAACGTAACGGACC-ACAAAAGTGGATCCATACAAATACATCTCAT 

B. n. GCAATGGTGTTCTAGTATTTGATTGTTTTACATGTATGTATCTCTT- ATTTACATGAAAT 

*** * ***** *** ★ *** * * ***** ** 

Con. 4 . KVMATGSWNTNSYARWAYKTRAYKGWYYNACAWRWRWGKATCYMTDNAWWTACATSWMAT 

A. t. AGC-TTCCTCCATTATTTTCCGACACAAA-CAGAGCA 

L.a. CGC-ACCCTC T C C G AC AC AAAAC T G AAC A 

B. n. TTTTAAACGCC TAAAAAAAAAAACGGAATTCCG 

** * *****★★■*. 

Con . 4 HKYNWHMCKCNNNNNNNNTMMRAMAMAAANCDGARYWNNN 
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Figure 5: Alignment of A.t. and L.a. FAE1 promoters 

CLUSTAL W (1.74) multiple sequence alignment 



A.t. ACTCATAA 

L . a . C GCC GGG GAG T T T C AGC T T AAC C GG T AAAAT T G G C C T G T AC AT AT AT T T AC C ACT GAGT A 

★ * * * ★ 

Con. 5 ACTSAKWA 

A. t . AAACTAGTAGAT — TGGTTGGTTGGTTTCCA — TGTACCAGAAGGCTTACCCTATTAGTT 

L.a. AAGACATCAGTTAATGATTTGTTGTTACTCAATTGGGCTAAGTGTATTATTATATGTGTT 

** * ** * ** ** **** ■*- ** ★ ★ * ★ ★ *** ★ * * *** 

Con . 5 AARMYAKYAGWTNNTGRTTKGTTGKTWYYCANNTGKRCYARRWGKMTTAYYMTATKWGTT 

A.t. GAAAGTTGAAACTTTGTTCCCTACTCAATTCCTAGTTGTGTAAATGT ATGTATATGT 

L.a. G T AT AT AAT AAAG GT AG AAC G T AA — ATTTACTAAGAATGTGTTTTTCCAATGTGATTGC 

★ + * * * ★ * ** * *★ ★ *•*■ *** ★ * **** -«•-*■ 

Con . 5 GWAWRTWRWAAMKKTRPCWMCSTAMNNAWTTMCTARKWRTGTRWWTKTNNNATGTRWWTGY 

A.t. AAT G C G T AT AAAAC G TAG T AC T T AAAT G AC TAG GAG T G G T T C T T GAG AC C GAT GAGA 

L.a. TCTTTGGCCTCTTAGGTTTGAATCCTACTCGAGAAG-ACTAATTTTAATT7ACTGGCAAA 

Con . 5 WMTNNNGCSTMTWARRYKTRRWWCYTAMWYGA3WAGNASTRRTTYTWRWKWMCKRKSARA 

A.t. G ATGGG AG C AG AAC T AAAG ATG AT G AC AT AAT T A AG AAC G AAT T T GAAAGG - C T 

L.a. AATAGAAATCAATTTATAAGTGTTTAAACAAATCGATGGTATAACTGATTAGTGATCACT 
+ * * * ★ ** * ** ★ ★ * *★ ★ * *** *** ★ ★ ■*■ + 

Con . 5 RATRGRARYMRAWYTAWARRTGWTKAMAYAAWTMNNNNNNAKAACKRATTWGWRAKSNCT 

A.t. CTTAGGTTTGAATCCTATTCGAGAATGTTTTTGTCAAAGATAGTGGCGATTTTGAACCAA 
L.a. CTTAGGTTTTGATCCAACTCGAGTATTGAGTATTGAACGCTT TTTTTAAATAA 

★**★****★ **** ★ *★ * * •*• * * * *<*-** ** ** 

Con . 5 CTTAGGTTTKRATCCWAYTCGAGWATKKWKTWKTSAAMGMTWNNNNNNNTTTTKAAMYAA 

A.t. AG AAAAC AT T T AAAAAAT C AG TAT C C G G T T AC GTTCATGCAAATAGAAAGTGGTCT 

L.a. AATCTTGATTTTTAAATTGGTTTTTTGAGTAAAAAAGTTCTTAATATTTTCTCTTTGTTT 

★ ★ *** * + * ★ ★ ★ *★** *•*•■*■ ★ ★* * 

Con . 5 ARWMWWSATTTWWAAAWTSRKTWTYYGRKTAMNNNNGTTCWTRMWAWTWKMWMKTKGTTT 

A.t. A GGATCTGATTGTAATTTTAGA — CTTAAAGAGTCTC — TTAAGATTCAATCCT GGC 

L.a. TAATGGGTTTGTTTTGCATTTTATAAGCTTAATTTTTCTAATTTAATATTTTATCTATCA 

Con . 5 WNNNGGRTYTGWTTKKMATTTTAKANNCTTAAWKWKTCTMNNTTAAKATTYWATCYWKSM 

A.t. T-GTGTACAAAACTACAAATAATATA TTTTAGACTATTTGGCCTTAACTAAACTTC 

L.a. TCGTCCGTAAAGTTTTATTTGGCACAAACTTGTTTTACTTTTCTACCTTA — TAATTTGG 

★ ** * + * + + * ★ ★ * ★ ★ * * * * * * -*• + 

Con . 5 TNGTSYRYAAARYTWYAWWTRRYAYANNNNTKTTWKACTWTT YKRCCTTANNTAAWYTKS 

A.t. CA-CTCATTATTTACTGAGGTTAGAGAATAGACTTGCGAATAAACACATTCCCGAGAAAT 
L.a. GAACTGGTTGAGT-CAAAGCGTACCGGACAAATATGTTT-TATATTCTTATTTAAGAATT 
* ** ** * * * + ** * + * ★ ** *★ ★ * ★ **** * 

Con . 5 SANCTSRTTRWKTNCWRAGSKTASMGRAYARAYWTGYKWNTAWAYWCWTWYYYRAGAAWT 

-400 

A. t. -4 32 AC T CAT GAT C C CAT AAT TAG T C AG AG G G TAT G jcjCAAT C AG AT C T AAGAAC A 

L.a. AACAC T CAT C T CAT AAT TAG T C AGAG GC TAGG GAG AT T C AG C C AAT C AAT G C T AAC AAC A 

★ * *** **************** ★ * * * * * + * ★ + *** * * ★ * 

Con . 5 AMYMMTSATCYCATAATTAGTCAGAGGSTAKGNNNNNNNNNCCAATCARWKCTAASAACA 
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Figure 5 Continued: Alignment of A.t. and La, FAE1 promoters 

A. t. -381 CACATTCCCTCAA — ATTTTA — ATGCACATGTAATCAT AGTTT AGCA 

L.a . AA-ATTCTCTTAATGATCTAACGATGCT-ATTTAATATTCGGATCAGTATTCTTAAATAA 

Con . 5 MANATTCYCTYAANNATYTWANNATGCWNATKTAATMWTNNNNNNAGTWTNNNNNNAKMA 

A. t- -337 CAATTCAAAAATAATGTAGTA- TTAAAGACAGAAATTTGTA — GACTTTTTT — TTGGCG 

L.a. G AATAT AAAACT AAT T C AAT AGT T AC AG AT AAAAACT T AT AT AG AC t T T T T TAT T T GG AA 

Con . 5 SAATWYAAAAMTAATKYARTANTTAMAGAYARAAAYTTRTANNGACTTTTTTNNTTGGMR 

A. t. -28 2 T-TAAAGGAA GACTAAGTTTATA-CGT ACATTT-TAT 

L.a. T AT AAAAG T AT C AAT AT AT T AT AG AC AAT ATT T AT AAC GT T AAAAAT AC AAT ATT T AT AT 
★ ★★★★ ★ * * + ★ *★* 

Con . 5 TNTAAARGWANNNNNNNNNNNNNGACWAWRTTTATANCGTNNNNNNNNNNAYATTTNTAT 

A. t. -2 4 7 TTTAAGTGGA AAACCGAAATT — TTCCATCGAAATATATGAATTT -AGTATA 

L.a. T T T T TAT AT AT T TAT T T C AAAT T G AAAAG CAT T AC T T C T AT C G AAAT G AAT T T TAG TAT A 

Con . 5 TTTWWRTRKANNNNNNNNAAAYYGAAAWKNNTTMCWTCKAWMKAWATGAATTTNAGTATA 

A. t. -198 T ATATTTCTGCAAT-GTACTATTTTGCTATTTTGGCAACTTTCAGTGGACTACT 

L.a. TTAATTAATATTTTTTTAATCGGACTACTTTCCTATTTTGGCACCTTTCATCTGACTACT 

Con . 5 TNNNNNNATATTT YTKYAATNGKACTAYTTTSCTATTTTGGCAMCTTTCAKYKGACTACT 

A.t. - 1 4 5 ACTTTATTACAATGTGTATGGATGCATGAGTTTGAGTA-TACACATGTCTAAATGCATGC 

L.a. AAT T TAT T T C AAT G T G TAT G CATG C AT GAGC AT G AGT AAT AC AC AT G T C TAT AT AAAT G C 

Con . 5 AMTTTATTWCAATGTGTATGSATGCATGAGYWTGAGTANTACACATGTCTAWATRMATGC 

A.t. - 8 6 TTTGCAAAACGTAACGGACCACAAAAGAGGATCCATGCAAATACATCTCATAGCTTCCTC 

L.a. AT-GTAAAACGTAACGGACCACAAAAGTGGATCCATACAAATACATCTCATCGCACCCTC 

Con . 5 WTNGYAAAACGTAACGGACCACAAAAGWGGATCCATRCAAATACATCTCATMGCWYCCTC 

A.t. -26 CAT TAT T T T C C G AC AC AAA - C AG AGCA 

L.a. T C C G AC AC AAAAC T G AAC A 

Con. 5 NNNNNNNNTCCGACACAAANCWGARCA 
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A: FAE1 promoter-castor hydroxylase 
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B: Napin promoter-castor hydroxylase 
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Figure 7: Alignment of B.n. and L.a. FAE1 promoters 
CLUSTAL W (1.81) multiple sequence alignment 



BnFAEl 
LaFAEl 



GG7TGGGCAAA7C7GAC77CACCAAAGAAACAAC7CGAG7CG77A7CCA7C7CC7CA7AA 60 



BnFAEl 
LaFAEl 



CCA7CGC7CCAC7C777GCC77CACCG7777CGG77CGG77C7C7ACA7CGCAACCCGGC 12 0 



BnFAEl 
LaFAEl 



BnFAEl 
LaFAEl 



CCAAACCGGT7TACCTCGTTGAGTACTCATGC7ACCTTCCACCAACGCATTGTAGATCAA 18 0 

CGCCGGGGAG7 -77CAGC77AACCGG7AAAA77GGCC7G7ACA7A7A 4 6 

* ** **** ** * * ★ + ** * 

G7ATC7CCAAGG7CA7GGA7A7C7777A7CAAG7AAGAAAAGC7GA7CC77CTCGGAACG 2 4 0 
7TTACCAC7GAG7-AAAGACA7CAG77AA7GA777 G77G77AC7CAA77GGGC7 9 9 



BnFAEl 
LaFAEl 



GCACG7GCGA7GAC7CG7CG7GGC77GAC77C77GAGGAAGA77CAAGAACG77CAGG7C 300 
AAG7G7A77A77A7A7G7G77G TA7A7AA7AAAGG7 AGAACG7 — AAATT 14 7 



BnFAEl 
LaFAEl 



TAGGCGA7GAAAC7CACGGGCCCGAGGGGC7GC77CAGG7CCC7CCCCGGAAGAC7777G 360 
7A G7AAGAA7G7G77777CCAA7G7GA77GC7C777GGCC7C77AGG777GAA7CC7A 205 



BnFAEl 
LaFAEl 



CGGCGGCGCG7GAAGAGACGGAGCAAG77A7CA77GG7GCGC7AGAAAATC7A77CAAGA 420 
CT CGAGAAGAC7AAT777AA7-77AC7GGCAAAAA7AGAAA7CAA777A7AA 256 



BnFAEl 
LaFAEl 



ACACCAACG77AACCC7AAAGA7A7AGG7A7AC77G7GG7GAAC7CAAGCA7G777AA7C 4 80 
GTG777AAACAAA7C — GA7GG7A7AAC7G-A77AG7GA7CAC7CT7AGG77 — 77GA7C 311 



BnFAEl 
LaFAEl 



CAAC7CCA7CGC7C7CCGCGA7GG7CG77AACAC777CAAGC7CCGAAGCAACG7AAGAA 54 0 

CAAC7CGAG7A77G AG7A77GAACGC777 T777AAA7AAAA7C77GA 35 8 

****** * * ** * *** *•** * * + ** + * 



BnFAEl 
LaFAEl 



GC777AACC77GG7GGCA7GGG77G7AG7GCCGGCG77A7AGCCA77GA7C7AGCAAAGG 600 
77777AAA-77GG777777GAG7AAAAAAG77C77AA7A7777C7C77-7G7777AA7GG 416 



BnFAEl 
LaFAEl 



BnFAEl 
LaFAEl 



AC77G77GCA7G7CC-A7AAAAA7ACG7A7GC7C77G7GG7GAGCACAGAGAACA7CAC7 65 9 
G7T7G7777GCA7777A7AAGC77AA77777C7AA777AA7- A7777A7C7A7CA7CG7C 4 7 5 



* * * * * 



7A7AACA777ACGC7GG7GA7AA7AGG7CCA7GA7GG777CAAA77GC77G77CCG7G77 719 
CGTAAAG777 TA777GGCACAAAC77G7777A C7777C7ACC77A7A 522 



BnFAEl 
LaFAEl 



BnFAEl 
LaFAEl 



BnFAEl 
LaFAEl 



GGTGGGGCCGC7A7777GC7C7CCAACAAGCC7GGAGA7CG7AGACGG7CCAAG7ACGAG 77 9 

ATTTGGGA-AC7GG77GAG7CA AAGCG7 ACCGGACAAA7A7G7777A7A77C 573 

* *** ** ** ** ★ ★ + * * * # * # + 

CTAGT7CACACGG77CGAACGCA7ACCGGAGC7GACGACAAG7C7777CG77GCG7GCAA 839 

-TTA777A-AGAA77AACAC7CA7C7CA7AAT7AG7CAGAGGC 7AGGGAGA77 624 

***** ** # * + + + *** * * * * * * * 

CAAGGAGACGA7GAGAACGGCAAAA7CGGAG7GAG777G7CCAAGGACA7AACCGA7G77 899 
CAGCCAA7CAA7GC7AACAACAAAA77C7C77AA — TGA7C7AACGA7GC7A777AA7A7 682 
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Figure 7 Continued: Alignment oiB.n. andJLo. FAE1 promoters 



BnFAEl 
LaFAEl 



GC7GG7CGAACGG77AAGAAAAACA7AGCAACG77GGG7CCG77GA77C77CCG77A-AG 958 

TCGGATCAGTATTCTTAAATAAGAATATAAA ACTAATTCAATAGTTACAG 732 

* * ** ***** *** ** ***** ****** 



BnFAEl 
LaFAEl 



CGAGAAACTTCTTTTTTTCGTTACC7TCATGGGCAAGAAACTTT7CAAAGATAAAATCAA 1018 

A7AAAAAC77A7A7AGAC777777A777G-GAA7A7AAAAG7A7CAA7A7A77A7AGACA 7 91 
* ****** * * ****** *** ** ****** * 



BnFAEl 
LaFAEl 



ACA77AC7ACG7CCCGGA777CAAAC77GC7A77GACCA77777G7A7ACA7GCCGGAGG 107 8 

A7A777A7A ACG77AAAAA7ACAA7A777A7A777777A7A7A777A777CAAA 8 4 5 

* *** ** ** *** **** * ***** **** * * 



BnFAEl 
LaFAEl 



CAGAGCCG7GA77GA7G7GC7AGAGAAGAACC7AGCCC7AGCACCGA7CGA7G7AGAGGC 1138 
77GAAAAGC A77 AC77 C7 A7 C G AAA7 G AA7777 AG7 A7A77AA77AA7A7777777 901 



BnFAEl 
LaFAEl 



A7CAAGA7CAACG77ACA7AGA777GGAAACAC77CA7C7AGC7CAA7A7GG7A7GAG77 1198 

AA7CGGAC7AC777CC7A7 T77GGCACC777CA7C7GAC7 AC7 94 4 

* ** * * ** ***** ******* ** * 



BnFAEl 
LaFAEl 



GGCA7ACA7AGAAGCAAAAGGAAGGATGAAGAAAGG7AA7AAAG777GGCAGA77GC777 1258 
AA777 A777C AA7G7 G7A7GCA7 GC A7 G AGC A7 GAG7 AAT A CACA7G7C7A7 996 



BnFAEl 
LaFAEl 



AGGG7C AGGC777AAG7G7AAC AG7GCAG777GGG7GGC7C7AAACAA7G7CAAAGC77C 1318 
A7 AAA7 G C A7 G7 AAAAC G 7 AAC G G - AC C AC AAAAG7 G G A7 C C A7 AC AAA7 AC A7 C7C A7 C 1055 



BnFAEl 
LaFAEl 



GACAAA7AG7CC77GGGAACAC7GCA7CGACAGA7ACCCGG7CAAAA77GA77C7GA77C 1378 
G -C AC C C7 C7 C C G AC AC AAAAC 7 G AAC A 10 8 2 



t * * * * 



BnFAEl 
LaFAEl 



AGG7AAG7CAGAGAC7CG7G7CCAAAACGG7CGG7CC7AA7AAACGA7G777GC7C7C77 14 38 



BnFAEl 
LaFAEl 



7CG777C77777A777G77A7AA7AA777GA7GGC7ACGA7G777C7C77G777G77A7G 14 98 



BnFAEl 
LaFAEl 



AA7AAAGAA7GCAA7GG7G77C7AG7A777GA77G7777ACA7G7A7G7A7C7C77A777 15 5 8 



BnFAEl 
LaFAEl 



ACA7GAAA77777AAACGCC7AAAAAAAAAAACGGAA77CCG 1600 
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Figure 8: Alignment of Buu and A.L FAE1 promoters 

CLUSTAL W (1.81) multiple sequence alignment 



AtFAEl 
BnFAEl 



GGTTGGGCAAATCTGACTTCACCAAAGAAACAACTCGAGTCGTTATCCATCTCCTCATAA 6 0 



AtFAEl 
BnFAEl 



CCATCGCTCCACTCTTTGCCTTCACCGTTTTCGGTTCGGTTCTCTACATCGCAACCCGGC 120 



AtFAEl 
BnFAEl 



CCAAACCGGTTTACCTCGTTGAGTACTCATGCTACCTTCCACCAACGCATTGTAGATCAA 18 0 



AtFAEl 
BnFAEl 



ACT CATAAAA 10 

GTATCTCCAAGGTCATGGATATCTTTTATCAAGTAAGAAAAGCTGATCCTTCTCGGAACG 24 0 



AtFAEl 
BnFAEl 



ACTAGT AG ATT GGTTGGT — T G GTTTCC ATGT ACC AGAAGGCTT ACCCTATTAGT 63 

GCACGTGCGATGACTCGTCGTGGCTTGACTTCTTGAGGAAGATTCAAGAACGTTCAGGTC 300 



* * * * * •*• -i 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



TGAAAGTTGAAACTT-TGTTCCCTACT — CAATTCCTAGTTGTGTAAATGTATGTATATG 12 0 
TAGGCGATGAAACTCACGGGCCCGAGGGGCTGCTTCAGGTCCCTCCCCGGAAGACTTTTG 360 



* * * * 



TAATG-CGTATAAAACGTAGTACTTAAATGACTAGGAGTGGTTCTTGAGACCGATGAGAG 17 9 

CGGCGGCGCGTGAAGAGACGGAGC-AAGTTATCATTGGTGCGCTAGAAAATCTATTCAAG 419 
* * * * ** * * ★ ** * * * * * * * * * ** ** 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



A TGGGAGCAGAACTAAAGATGATGACATAATTAAGAACGAATTTGAAAGGCTCTTA 235 

AACACCAACGTTAACCCTAAAGATATAGGTATACTTGTGG-TGAACTCAAGCATGTTTAA 47 8 

* * * *★ ★ #** * * * * * 



* ******** 



GGTTTGAATCCTATTCGAGAATGTTTTTGTCAAAGATAGTGGCGA-TTTTGAACCAAAGA 2 9 4 
TCCAACTCCATCGCTCTCCGCGATGGTCGTTAACACTTTCAAGCTCCGAAGCAACGT 535 



*■ + * * * 



* * * * * * * 



AtFAEl 
BnFAEl 



AAACATTTAAAAAATCAGTATCC — GGTTAC -GTTCATGCAA-ATAGAAAGTGGTCTAGG 3 5 0 

AAGAAGCTTTAACCTTGGTGGCATGGGTTGTAGTGCCGGCGTTATAGCCATTGATCTAGC 5 95 
* * * * ** * * * * **»* • * * » ■+ ★ * * * ♦ * * ***** 



AtFAEl 
BnFAEl 



ATCTGATTGTAATTTTAGACTTAAAGAGTCTCTTAAGATTCAATCCTGGCTGTGTACAAA 410 
AAAGGACTT — GTTGCATGTCCATAAAAATACGTATGCTCTTGTGGTGAGCACAGAGAAC 653 



* * * 



AtFAEl 
BnFAEl 



ACTACAAATAATATAT TTTAGACTATTTGGCCTTAACTAAACTTCCACTCATTATTT 4 67 

ATCACTTATAACATTTACGCTGGTGATAATAGGTCCATGATGGTTTCAAATTGCTTGTTC 713 
* ** **** ★ * * * ****** * 



* * 



AtFAEl 
BnFAEl 



ACTGAGGTTAGAGA-ATAGACTTGCGAATAAACACATTCCCGAGAAATACTCATGATCCC 526 

CGTGTTGGTGGGGCCGCTATTTTGCTCTCCAACAAG — CCTGGAGATCGTAGACGGTCCA 771 
** * * * * **** **** ** * * * * *** 



AtFAEl 
BnFAEl 



CE3 

AT AATT AGTC AG AG GGT AT G — C C AATC AGATCT AAG AAC fiC AC AT T C C C TQ AAATTTT A 584 
AGTACGAGCTAGTTCACACGGTTCGAACGCATACCGGAGCTGACGACAAGTCTTTTCGTT 831 



AtFAEl 
BnFAEl 



ATGCACATGTAATCATAGTTTAGCACAATTCAAAAATAATGTAGTATTAAAGACAGAAAT 64 4 
GCGTGCAACAAGGAGACGATGAGAACGGCAAAATCGGAGTGAGTTTGTCCAAGGACATAA 8 91 
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Figure 8 Continued: Alignment of B.n. and A.L FAE1 promoters 



AtFAEl 
BnFAEl 



TTGTAGACTTTTTTTTGGCGTTAAAGGAAGACTAAG TTTATACGTACATTTTAT 698 

CCGATGTTGCTGGTCGAACGGTTAAGAAAAACATAGCAACGTTGGGTCCGTTGATTCTTC 951 
* * * * ** * *** ** ** ** * * *** *** * 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



T -TTAAGTGGAAAACC GAAATTTTCCAT C G AAAT ATATG AATTTAGT AT ATATA 751 

CGTTAAGCGAGAAACTTCTTTTTTTCGTTACCTTCATGGGCAAGAAACTTTTCAAAGATA 1011 
***** * **** **** * * * ******* *** 

G box 2 

TTTCTGCAATGTACTATTTTGCTATTTTGG }CAACTTT rCAGTGGACTACTACTTTAT-TAC 810 

AAATCAAACATTACTACGTCCCGGATTTCA- AACTTGCTATTGACCATTTTTGTATACAT 1070 
* ***** * * *** ***** * * *** * * * *** * 



AtFAEl 
BnFAEl 



G-b ox 1 

AATGTGTATGGATGCATGAGTT-TGAGTATA jCACATCT CTAAATGCATGCTTTGCAAAAC 8 69 

GCCGG AGGCAGAGCCGTGATTGATGTGCTAGAGAAGAACCTAGCCCTAGCACCGATCGAT 1130 
* ** * *** **** ******* * 



AtFAEl 
BnFAEl 



GTAACGG- ACCACAAAAGAGGATCCAT GCAAATACATCTCATAGCTTCCTCCAT 922 

GTAGAGGCATCAAGATCAACGTTACATAGATTTGGAAACACTTCATCTAGCTCAATATGG 1190 
*** ***** * * * * *** * *** ** ♦ + ***** * 



AtFAEl 
BnFAEl 



TATTTTCCGACACAAACAGA-GCA 94 5 

TATGAG7TGGCATACATAGAAGCAAAAGGAAGGATGAAGAAAGG7AATAAAGTTTGGCAG 1250 
* * * ***** *** *** 



AtFAEl 
BnFAEl 



ATTGCTTTAGGGTCAGGCTTTAAGTGTAACAGTGCAGTTTGGGTGGCTCTAAACAATGTC 1310 



AtFAEl 
BnFAEl 



AAAGCTTCGACAAATAGTCCTTGGGAACACTGCATCGACAGATACCCGGTCAAAATTGAT 1370 



AtFAEl 
BnFAEl 



TCTGATTC AGGTAAGTCAGAG ACTCGTGTCCAAAACGGTCGGTCCTAATAAACGATGTTT 1430 



AtFAEl 
BnFAEl 



GCTCTCTTTCGTTTCTTTTTATTTG7TATAATAATT7GATGGCTACGATGTTTCTCTTGT 14 90 



AtFAEl 
BnFAEl 



TTGTTATGAATAAAGAATGCAATGGTGTTCTAGTATTTGATTGTTTTACATGTATGTATC 1550 



AtFAEl 
BnFAEl 



TCTTATTTACATGAAATTTTTAAACGCC7AAAAAAAAAAACGGAATTCCG 1600 
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